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The determination of the nucleotide sequ- 
ence of HTLV-III DNA; identification, isolation 
and expression of HTLV-III sequences which 
encode immunoreactive polypeptides by re- 
combinant DNA methods and production of 
viral RNA are disclosed. Such polypeptides can 
be employed in immunoassays to detect 
HTLV-III. 
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Technical Fields 

This invention is in the f ields of molecular biology 
and virology and in particular relates to human T cell 
leukemia virus - type III (HTLV-III). 

Background 

The term human T cell leukemia-lymphoma virus 
(HTLV) refers to a unique family of T cell tropic retro- 
viruses. These viruses play an important role in the 
pathogenesis of certain T cell neoplasms. There are 
presently three known types of HTLVs. One subgroup 
of the family, HTLV-type I (HTLV-I), is linked to the 
cause of adult T-cell leukemia-lymphoma (ATLL) that 
occurs in certain regions of Japan, the Caribbean and 
Africa. HTLV-type II (HTLV-II) has been isolated from 
a patient with a T-cell variant of hairy cell leukemia. 
M.Popovicetal., Detection, Isolation, and Continuous 
Production of Cytopathic Retroviruses (HTLV-III) 
from Patients with AIDS and Pre-AIDS. Science , 
224:497-500 (1984). 

HTLV-type III (HTLV-III) has been isolated from 
may patients with acquired immunodeficiency syn- 
drome (AIDS). HTLV-III refers to prototype virus iso- 
lated from AIDS patients. Groups reported to be at 
greatest risk for AIDS include homosexual or bisexual 
males; intravenous drug users and Haitian immi- 
grants to the United States. Hemophiliacs who re- 
ceive blood products pooled from donors and reci- 
pients of multiple blood transfusions are also at risk. 
Clinical manifestations of AIDS include severe, unex- 
plained immune deficiency which generally involves 
a depletion of helper T lymphocytes. These may be 
accompanied by malignancies and infections. The 
mortality rate for patients with AIDS is high A less se- 
vere form of AIDS also exists, in which there may be 
lymphadenopathy and depressed helper T cell 
counts; there is not, however, the devastating illness 
characteristic of full-blown AIDS. There are many in- 
dividuals, who are classified as having early AIDS 
(pre- AIDS), who exhibit these signs. It is not now pos- 
sible to predict who among them will develop the 
more serious symptoms. 

Much of the evidence implicates HTLV-III as the 
etiological agent of the infectious AIDS. First, there is 
consistent epidemiology; greater than 95% of the pa- 
tients with AIDS have antibodies specif ic for HTLV-III. 
Second, there has been reproducible identification 
and isolation of virus in this disease; more than 100 
variants of HTLV-III have been isolated from AIDS pa- 
tients. Third, there has been transmission of the dis- 
ease to normal healthy individuals who received 
blood transfusions from infected blood donors. 

HTLV-III has been shown to share several prop- 
erties with HTLV-I and HTLV-II but also to be morpho- 
logically, biologically and antigenically distinguish- 
able. R.C. Gallo et al., Frequent Detection and Isola- 



tion of Cytopathic Retroviruses (HTLV-III) from Pa- 
tients with AIDS and at Risk for AIDS. Science , 
224:500-501. (1984). For example, HTLV-III has been 
shown to be antigenically related to HTLV-I and HTLV- 

5 II by demonstrating cross-reactivity with antibodies to 
HTLV-I and HTLV-II core proteins, p24 and p19, and 
envelope antigens and by nucleic acid cross- 
hybridization studies with cloned HTLV-I and HTLV-II 
DNAs. However, unlike HTLV-I and HTLV-II, it lacked 

10 the ability to infect and transform T cells from normal 
umbilical cord blood and bone marrow in vitro , and 
has the cytopathic effect on infected cells only. 

Like the RNA genome of other retroviruses, the 
RNA genome of HTLV-III contains three genes which 

15 encode viral proteins: 1 ) the gag gene, which encodes 
the internal structural (nucieocapsid or core) pro- 
teins; 2) the pol gene, which encodes the RNA-direct- 
ed DNA polymerase (reverse transcriptase); and 3) 
the env gene, which encodes the envelope glycopro- 

20 teins of the virion. In addition, the HTLV-III genome 
contains a region designated Px, located between the 
env gene and the 3' LTR, which appears to be in- 
volved in functional killing of the virus. 

At this time, AIDS is still difficult to diagnose be- 

25 fore the onset of clinical manifestations. There is no 
method presently available for the prevention of the 
disease. Treatment of those with AIDS is generally 
not successful and victims succumb to the devastat- 
ing effects HTLV-III has on the body. 

30 

Summary of the Invention 

This invention is based upon applicant's cloning 
of HTLV-III DNA in recombinant/vector host systems 

35 capable of expressing immunoreactive HTLV-III poly- 
peptides. Based on the cloning of HTLV-III DNA in 
systems which express immunoreactive- polypep- 
tides, applicant has developed methods useful in the 
diagnosis, treatment and prevention of AIDS. Appli- 

40 cant has developed methods of detecting HTLV-III 
and antibodies against HTLV-III in body fluids (e.g. 
blood, saliva, semen), and methods useful in immu- 
notherapy (e.g. vaccination and passive immuniza- 
tion against AIDS). In addition, applicant has devel- 

45 oped methods of making HTLV-III DNA probes and 
RNA probes useful in detecting HTLV-III in body flu- 
ids. 

Polypeptides encoded by segments of the HTLV- 
III genome have been produced by these recombi- 

50 nant DNA methods. For example, polypeptides en- 
coded by three regions of the HTLV-III genome (an 
env gene sequence, an env-lor gene sequence and a 
1 . 1 Kb EcoRI restriction fragmentfrom HTLV-III cDNA) 
have been produced. The polypeptides expressed 

55 have been isolated. These polypeptides are immuno- 
reactive with sera of patients having AIDS and with 
antibodies to HTLV-III and thus are useful in screen- 
ing blood and other body flu ids for the presence of an- 
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tibodies against HTLV-III. Applicant's invention there- 
fore provides a method not only for diagnosis AIDS, 
but also for preventing the transmission of the dis- 
ease to others through blood or blood components 
harboring HTLV-III. The latter is particularly valuable 
in screening donated blood before it is transfused or 
used to obtain blood components (e.g., Factor VIII for 
the treatment of hemophilia; Factor IX) 

Polypeptides produced by the recombinant DNA 
methods are employed in the production of antibo- 
dies, including monoclonal antibodies, against the vi- 
rus. Such antibodies form the basis for immunoassay 
and diagnostic techniques for directly detecting HTLV- 
III, in body fluids such as blood, saliva, semen, etc. 
Neutralizing antibodies against the virus may be used 
to passively immunize against the disease. 

Applicant's cloning of HTLV-III DNA in such re- 
combinant vector host systems also provides the ba- 
sis for determination of the nucleotide sequence of 
HTLV-III DNA. The DNA probes are homologous to 
DNA regions which are unique to the HTLV-III gen- 
ome. DNA probes provide another method of detect- 
ing HTLV-III in blood, saliva or other body fluids. RNA 
probes which contain regions unique to the HTLV-III 
genome can also be formed and used for the detec- 
tion of HTLV-III in body fluids. 

Brief Description of the Figures 

Figure 1 is a representation of HTLV-III DNA. fig- 
ure 1 a shows sites at which the genome is cut by the 
restriction enzyme Sstl and Figure 1 b shows the frag- 
ments of HTLV-III genome produced through the ac- 
tion of restriction enzymes Kpn, EcoRI and Hind III. 

Figure 2 is a representation of HTLV-III DNA. Fig- 
ure 2a shows the location of restriction enzyme sites 
in the genome and Figure 2b shows the location in the 
HTLV-III genome of DNA inserts in open rending 
frame clones. The (+) and (-) indicate reactivity and 
lack of reactivity, respectively, of the fusion protein 
expressed by cells transformed by the ORF vectors 
with sera of AIDS patients. 

Figure 3 shows the nucleotide sequence for 
HTLV-III DNAand the predicted amino acid sequence 
of the four longest open reading frames. Restriction 
enzyme sites are indicated above the nucleotide se- 
quence. 

Figure 4 is an immunoblot showing the position 
on an SDS polyacrylamide gel of HTLV-III env-Beta- 
galactosidase fusion proteins. 

Figure 5 shows sites at which the genome is cut 
by the restriction enzyme EcoRI and construction of 
recombinant plasmids carrying HTLV-III DNA. 

Figure 6 is an immunoblot showing the positions 
on nitrocellulose blots of peptides produced by bac- 
terial cells transformed by recombinant construct 
ompA1-R-6; ompA2-R-7 and ompA3-R-3, into which 
a 1 .1 Kb EcoRI HTLV-III cDNA restriction fragment had 
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been inserted. Figure 6a shows the nucleotide se- 
quence of the ompA signal peptide and the pertinent 
region of recombinant plasmids ompA1-R-6; ompA2- 
R-7 and ompA3-R-3. 

5 Figure 7 is an immmunoblot showing blocking of 

reaction between HTLV-III antigens and an AIDS ser- 
um by lysates of E.coli containing HTLV-III DNA re- 
combinant plasmid ompA1-R-6 (lanes 1-5) and no 
blocking of the reaction by lysates of E.coli control 

w cells (lanes 6-10). 

Figure 8 is an immunoblot showing the presence 
or absence of antibodies against the peptide encoded 
by the 1.1Kb EcoRI HTLV-III restriction fragment of 
HTLV-III cDNA in sera from healthy individuals (lanes 

15 1-3) and from AIDS patients (lanes 4-11). Purified 
HTLV-III virus (panel A) or total cell lysate of bacterial 
clone ompA1-R-6(01R6) were reacted with sera 
samples. 

Figure 9 represents the open reading frame ex- 
20 pression vector pMRlOO having HTLV-III DNA. 

Figure 10 represents lambdaCI-HTLV-lll beta- 
galactosidase fusion proteins. Figure 10a is an immu- 
noblot showing the position on SDS polyacrylamide 
gel of lambdaCI-HTLV-lll beta-galactosidase fusion 
25 proteins, and Figure 10b shows the immunoreactivity 
of such proteins with sera from AIDS patients. 

Best Mode of Carrying Out the Invention 

30 Despite the similarity between HTLV-III and the 

other members of the HTLV-bovine leukemia virus 
(BLV) family of viruses, the biology and pathology of 
HTLV-III differs substantially. For example, relatively 
little homology has been found in the HTLV-III gen- 

35 ome when compared with that of the HTLV-I or -II 
genome. Infection with HTLV-III often results in pro- 
found immunosuppression (AIDS), consequent to the 
depletion of the OKT4(+) cell population. This effect 
is mirrored by a pronounced cytopathic, rather than 

40 transforming, effect of HTLV-III infection upon the 
OKT4(+) cells in lymphocyte cultures in vitro . In con- 
trast, infection with HTLV-I results in a low incidence 
of T-cell leukemia lymphoma (an OKT4(+) cell malig- 
nancy). There is evidence for some degree of immu- 

45 nodef iciency in HTLV-I patients as well. Infection of 
primary lymphocytes in culture by HTLV-I and -II re- 
sults in vitro transformation of predominantly 
OKT4(+) cells. A cytopathic effect of HTLV-I infection 
upon lymphocytes is apparent, but the effect is not as 

50 pronounced as that observed for HTLV-III. 

HTLV-III also differs from HTLV-I and -II in the ex- 
tent of infectious virion production in vivo and in vitro . 
High titers of cell free, infectious virions can be ob- 
tained from AIDS patient semen and saliva and from 

55 the supernatant of cultures infected with HTLV-III. 
Very few, if any, cell free infectious virions can be re- 
covered from adult T-cell leukemia lymphoma (ATLL) 
patients or from cultures infected with HTLV-I or -II. 
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Envelope glycoprotein is the major antigen rec- 
ognized by the antiserum of AIDS patients. In this re- 
spect, HTLV resembles other retroviruses, for which 
the envelope glycoprotein is typically the most anti- 
genic viral polypeptide. In addition, the neutralizing 
antibodies are generally directed toward the envelope 
glycoprotein of the retrovirus. Serum sample from 88 
percent to 1 00 percent of those with AIDS have been 
shown to have antibodies reactive with antigens of 
HTLV-ill; the major immune reactivity was directed 
against p41 , the presumed envelope antigen of HTLV- 
III. Antibodies to core proteins have also been dem- 
onstrated In serum of AIDS patients, but do not ap- 
pear to be as effective an indicator of infection as is 
the presence of antibodies to envelope antigen. 

The p41 antigen of HTLV-III has been difficult to 
characterize because the viral envelope is partially 
destroyed during the process of virus inactivation and 
purification. This invention responds to the great 
need to characterise this antigenic component of the 
HTLV-III virus and to determine the existence and 
identity of other viral antigenic components in several 
ways. It provides products, such as HTLV-III polypep- 
tides, antibodies to the polypeptides and RNA and 
DNA probes, as well as methods for their production. 
These serve as the basis for screening, diagnostic 
and therapeutic products and methods. 

This invention relates to HTLV-III polypeptides 
which are produced by translation of recombinant 
DNA sequences encoding HTLV-III proteins. Polypep- 
tides which are produced in this way and which are 
immunoreactive with serum from. AIDS patients or 
antibodies to HTLV-III are referred to as recombinant 
DNA-produced immunoreactive HTLV-III polypep- 
tides. They include, but are not limited to, antigenic 
HTLV-III core and envelope polypeptides which are 
produced by translation of the recombinant DNA se- 
quences specific to the gag and the env DNA se- 
quences encoding HTLV-III core proteins and envel- 
ope glycoproteins, respectively. They also include the 
polypeptides which are produced by translation of the 
recombinant DNA sequences included in a 1.1Kb 
EcoRI restriction fragment of HTLV-III cDNA and re- 
combinant DNA sequences specific to the sor gene 
and the Px genes of HTLV-III. The sor DNA sequence 
is common to replication competent HTLV-III viruses. 
The Px genes contain a coding sequence with one 
large open reading frame (lor), located between the 
env gene and the 1" end of the HTLV-III genome. Both 
the env DNA sequences and the lor DNA sequences 
are located within the same open reading frame of the 
HTLV-III genome and this gene region is accordingly 
designated env-lor . 

The polypeptides encoded by these regions of 
the HTLV III can be used in immunochemical assays 
for detecting antibodies against HTLV-III and HTLV- 
VIII infection. These methods can assist in diagnosing 
AIDS. In addition, they can also be employed to 



screen blood before it is used for transfusions or for 
the production of blood components (e.g., Factor VIII 
for the treatment of hemophilia). Availability of 
screening technics will reduce the risk of AIDS trans- 

5 mission. 

Detection of antibodies reactive with the polypep- 
tides can be carried out by a number of established 
methods. For example, an immunoreactive HTLV ill 
polypeptide can be affixed to a solid phase (such as 

10 polystyrene bead or other solid support). The solid 
phase is then incubated with blood sample to be test- 
ed for antibody against HTLV-III. After an appropriate 
incubation period the solid phase and blood sample 
are separated. Antibody bound to the solid phase can 

15 be detected with labeled polypeptide or with a labeled 
antibody against human immunoglobulin. 

HTLV-III polypeptides can be used in a vaccine 
prevention of AIDS. For vaccination against the virus, 
immunogenic polypeptides which elicit neutralizing 

20 antibody would be employed. The leading candidates 
for use in vaccines are the viral envelop polypeptides. 

The polypeptides can also be used to produce an- 
tibodies, including monoclonal antibodies, against 
the HTLV-III polypeptides. These antibodies can be 

25 used in immunochemical assays for direct detection 
of the virus in body fluids (such as blood, saliva and 
semen). Assays employing monoclonal antibody 
against specific HTLV III antigenic determinants will 
reduce false- positive results thereby improving accu- 

30 racy of assays for the virus Antibodies against the vi- 
rus may also be useful in immunotherapy. For exam- 
ple, antibodies may be used to passively immunize 
against the virus. 

The methods of producing the polypeptides are 

35 also a subject of this invention, as are diagnostic 
methods based on these polypeptides. 

This invention also provides methods for the iso- 
lation of genes of HTLV-III which encode immuno-re- 
active polypeptides; identification of the nucleotide 

40 sequence of these genes; introduction of DNA se- 
quences specific to these viral DNA sequences into 
appropriate vectors to produce viral RNA and the for- 
mation of DNA probes. These probes are comprised 
of sequences specific to HTLV-III DNAand are useful, 

45 for example, for detecting complementary HTLV-III 
DNA sequences in body fluids (e.g., blood). 

HTLV-III POLYPEPTIDES 

so Genetic engineering methods are used to isolate 

segments of HTLV-III DNA which encode immunor- 
eactive HTLV-III polypeptides. Among these are poly- 
peptides which are immunoreactive with serum from 
AIDS patients or antibodies to HTLV-III. These poly- 

55 peptides include the core protein, a 15Kd peptide en- 
coded by a 1.1 K.b EcoRI HTLV-III restriction fragment 
of HTLV-III DNA and the envelope glycoprotein. 
These methods are also used to sequence the frag- 
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merits which encode the polypeptides. The proviral 
genes integrated into host cell DNA are molecularly 
cloned and the nucleotide sequences of the cloned 
provirus is determined. 

An E. coli expression library of HTLV-III DNA is 
constructed. The HTLV-III genome is cloned and cuts 
are then made in the cloned HTLV-III genome with re- 
striction enzymes to produce DNA fragments. (Fig- 
ures 1 and 2) HTLV-III DNA fragments of approximate- 
ly 200-500bp are isolated from agarose gel, end re- 
paired with T 4 polymerase and ligated to linker DNA. 
The linker ligated DNA is then treated with a restric- 
tion enzyme, purified from agarose gel and cloned in 
an expression vector. Examples of the expression 
vectors used are: OmpA, pIN (A,B and C), lambda pL, 
T7, lac, Trp, ORF and lambda gt11. In addition, mam- 
malian cell vectors such as pSV28pt, pSV2neo, 
pSVdhfr and VPV vectors, and yeast vectors, such as 
GALI and GAL10, may be used. 

The bacterial vectors contain the lac coding se- 
quences, into which HTLV-III DNA can be inserted for 
the generation of B-galactosidase fusion protein. The 
recombinant vectors are then introduced into bacteria 
(e.g., E.coli) ; those cells which take up a vector con- 
taining HTLV-III DNA are said to be transformed. The 
cells are then screened to identify cells which have 
been transformed and are expressing the fusion pro- 
tein. For example, the bacteria are plated on MacCon- 
key agar plates in order to verify the phenotype of 
clone. If functional B-galactosidase is being pro- 
duced, the colony will appear red. 

Bacterial colonies are also screened with HTLV- 
III DNA probes to identify clone: containing the DNA 
regions of interest (e.g., HTLV-III gag pol and env DNA 
sequences). Clones which are positive when 
screened with the DNA probe and positive on the 
MacConkey agar plates are isolated. 

This identification of cells harboring the HTLV-III 
DNA sequences makes it possible to produce HTLV- 
III polypeptides which are immunoreactive with HTLV- 
III specific antibody. The cells from the selected col- 
onies are grown in culture under conditions allowing 
the expression of the hybrid protein. Cell protein is 
then obtained by means known in the art For exam- 
ple, the culture can be centrrfuged and the resulting 
cell pellet broken. Polypeptides secreted by the host 
cell can be obtained (without disruption of the cells) 
from the cell culture supernatant. 

The total cellular protein is analyses by being run 
on an SDS polyacrylamide gel electrophoresis. The 
fusion proteins are identified at a position on the gel 
which contains no other protein. Western blot analys- 
es are also carried out on the clones which screened 
positive. Such analyses are performed with serum 
from AIDS patients, with the result that it is possible 
to identify those clones expressing HTLV-III B-galac- 
tosidase fusion proteins (antigens) that cross-react 
with the HTLV-III specific antibody. Lambda 10 clones 



harboring HTLV-III DNA are cloned from the replicat- 
ed form of the virus. As the retrovirus is replicating, 
double stranded DNA is being produced. The cloned 
HTLV-III DNA is digested with the restriction enzyme 

5 Sstl. (Figure 1a) Because there are two Sstl recogni- 
tion sites within the LTR of HTLV-III DNA one LTR re- 
gion is not present in the cloned DNA sequence re- 
moved from the lambda 10 vector. As a result, a small 
(approximately 200 bp) fragment of the HTLV-III DNA 

w is missing. 

The resulting DNA is linearized and fragments are 
produced by digesting the linearized genomic DNA 
spanning the env gene region with restriction en- 
zymes. For example, fragments are produced using 

15 Kpn or EcoRI plus Hind III, as shown in Figure 1 b. The 
resulting 2.3kb Kpnl-Kpnl fragments; LOkbEcoRI- 
EcoRI fragments and 2.4 EcoRI-Hindlll fragments are 
solated by gel electrophoresis and electro-elution. 
These fragments are randomly sheared to produce 

20 smaller fragments. The fragments thus produced are 
separated from agarose gel and DNA fragments be- 
tween about 200-500 are eluted. 

The eluted 200-500bp DNA fragments are end f il- 
led through the use of E. coli T 4 polymerase and blunt 

25 end ligated into an open reading frame expression 
(ORF) vector, such as pMRIOO. This ligation may oc- 
cur at the Smal site of the pMRIOO vector, which con- 
tains two promoter regions, hybrid coding sequences 
of lambdaCI gene and lacl-LacZ gene fusion se- 

30 quence. In the vector, these are out of frame se- 
quences; as a result, the vector is nonproductive. The 
HTLV-III DNA is inserted into the vector; the correct 
DNA fragments will correct the reading frame, with 
the result that CI-HTLV-lll-B-galactosidase fusion 

35 proteins are produced. The expression of the hybrid 
is under the control of the lac promoter. Based on the 
sequence of pMRIOO, it appears that if a DNA frag- 
ment insert cloned into the Smal site is to generate a 
proper open reading frame between the lambdaCI 

40 gene fragment and the lac-2 fragment, the inserted 
DNA must not contain any stop codons in the reading 
frame set by the frame of the lambdaCI gene. 

The recombinant pMRIOO vectors are than intro- 
duced into E. coli. The bacteria are plated on Mac- 

45 Conkey agar plates to verify the phenotype of the 
clone. If functional B-galactosidase is being pro- 
duced, the colomly will appear red. The colonies are 
also screened with HTLV-III DNA probes, for the pur- 
pose of identifying those clones containing the insert. 

50 Clones which are positive when screened with the 
DNA probe and positive on the MacConkey agar 
plates are isolated. 

The cells from the selected colonies are grown in 
culture. The culture is spun down and the cell pellet 

55 broken. Total cellular protein is analysed by being run 
on an SDS polyacrylamide gel. The fusion proteins 
are identified at a position on the gel which contains 
no other protein. (Figure 4) 
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Western blot analyses are also carried out on the 
clones which screened positive. Sera from AIDS pa- 
tients are used, thus making it possible to identify 
those clones which express the HTLV-III-D D-galac- 
tosidase fusion proteins that cross-react with the s 
HTLV-III specific antibody. 

1000 clones were screened by this method; 6 were 
positive. 

Because of the nature of the pMR100 cloning ve- 
hicle, a productive DNA insert should also be ex- 10 
pressed as a part of a larger fusion polypeptide. 
HTLV-III envgene containing recombinant clones was 
identified by colony hybridization. The production of 
largerfusion polypeptides bearing functional B-galac- 
tosidase activity was verified by phenotype identif ica- 1 5 
tion on MacConkey agar plates; by B-galactosidase 
enzymatic assays and by analysis on 75% SDS-poly- 
acrylamide gels. Immunoreactivity of the larger pro- 
tein with antibody to HTLV-III was assessed by west- 
ern blot analysis using serum from AIDS patients. 20 
These large fusion proteins also reacted with anti-B- 
galactosidase and anti-CI antiserum. This finding is 
consistent with the hypothesis that they are proteins 
ofCI-HTLV-lll-laclZ. 

The open reading frame insert fragment of HTLV- 25 
III is further analyzed by DNA sequencing analysis. 
Because one of the two BamHI sites flanking the 
Smal cloning site in pMR100 is destroyed in the clon- 
ing step, positive clones are digested with restriction 
enzymes Hindlll and clal to liberate the inserted 30 
HTLV-III DNAfragmenL The HTLV-III ORF inserts are 
isolated from the fusion recombinant and cloned into 
M13 sequencing cloning vector mp18 and mp19 di- 
gested with Hindlll and Accl. DNA sequences of the 
positive ORF clones are then determined. 35 

Fragments of HTLV-III DNAof approximately 200- 
500 bps are isolated from agarose gel, end repaired 
with T 4 polymerase and ligated to EcoRI linker. The 
EcoRI linker ligated DNA is then treated with EcoRI 
purified from 1 % agarose gel and cloned in an expres- 40 
sion vector, lambda gt11. This vector contains lac Z 
gene coding sequences into which the foreign DNA 
can be inserted for the generation of B-galactosidase 
fusion protein. The expression of the hybrid gene is 
under the control of lac repressor. The lac repressor 45 
gene, lac I, is carried on a separate plasmid pMC9 in 
the host cell, E. coli Y1 090. AIDS patient serum was 
used to probe the Iambdagt11 library of HTLV-III gen- 
ome DNA containing 1.5x10* recombinant phage. In 
a screen of 5000 recombinants, 100 independent so 
clones that produced strong signals were isolated. 
The positive recombinant DNA clones were further 
characterized for their specific gene expression. Rab- 
bit hyperimmune serum against P24 was also used to 
identify the gag gene specific clones. Nick-translated 55 
DNA probes of specif ic HTLV-III gene, specifically the 
gag gene, env gene and Pxgene were used to group 
the positive immunoreactive clones into specific gene 



region. 

Recombinant clones that produced strong sig- 
nals with AIDS serum and contain insert DNA span- 
ning the HTLV-III gag, pol, sor and env-lor gene re- 
gions were examined in detail by mapping their insert 
with restriction enzymes and DNA sequencing analy- 
sis. 

Determination of the Nucleotide Sequence of HTLV- 
III DNA 

Genetic engineering methods are used to deter- 
mine the nucleotide sequence of HTLV-III DNA. One 
technique that can be used to determine the se- 
quence is a shotgun/random sequencing methods. 
HTLV-III DNA is sheared randomly into fragments of 
about 300-500 bp in size. The fragments are cloned, 
for example, using ml3, and the colonies screened to 
identify those having an HTLV-III DNA fragment in- 
sert. The nucleotide sequence is then generated, 
with multiple analysis producing overlaps in the se- 
quence . Both strands of the HTLV-III DNA are se- 
quenced to determine orientation. Restriction map- 
ping is used to check the sequencing data generated. 

The nucleotide sequence of one cloned HTLV-III 
genome (BH10) is shown in Figure 3, in which the 
position of sequences encoding gag protein p17 and 
the N-terminus of gag p24 and the C-terminus of gag 
p15 (which overlaps with the N-terminus of the pol 
protein) are indicated. The open reading frames 
(ORF) for po[, sor and env-lor are also indicated. The 
sequence of the remaining 182 base pairs of the 
HTLV-III DNA not present in clone BH10 (including a 
portion of R, U5, the tRNA primer binding site and a 
portion of the leader sequence) was derived from 
clone HXB2. The sequences of two additional clones 
(BH8 and BH5) are also shown. Restriction enzyme 
sites arc listed above the nucleotide sequence; sites 
present in clone BH8 but not in clone BH10 are in par- 
entheses. Deletions are noted ([]) at nucleotides 251, 
254, 5671 and 6987-7001. The nucleotide positions 
(to the right of each line) start with the transcriptional 
initiation site. The amino acid residues are numbered 
(to the right of each line) for the four largest open ren- 
ding frames starting after the preceding termination 
codon in each case except gag which is enumerated 
from the first methionine codon. A proposed peptide 
cleavage site (V) and possible asparagine-l inked gly- 
cosylate sites are shown (*) for the env-lor open 
rending frame. The sequences in the LTR derived 
from clones BH8 and bH10 listed in the beginning of 
the figure are derived from the 3 -portion of each 
clone and are assumed to be identical to those pres- 
ent in the 5'-LTRof the integrated copies of these viral 
genomes. 

Clone HXB2 was derived from a recombinant 
phage library of Xbal digested DNA from HTLV-III in- 
fected H9 cells cloned in lambdaJL H9 cells are hu- 
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man leukemic cells infected by a pool of HTLV-III from 
blood of AIDS patients, F. Wong-Staal, Nature, 312, 
November, 1984. Cloning vector clones BH10, BH8, 
and BH5 were derived from a library of Sstl digested 
DNAfrom the Hirt supernatant fraction ofHTLV-lll in- 5 
fected H9 cells cloned in lambdagtWes.lambdaB. 
Both libraries were screened with cDNA probe syn- 
thesized from virion DNA using oligo.dT as a primer. 
Clones BH8, BH5, and a portion of HXB2 were se- 
quenced as described by Maxam and Gilbert. (1980) 10 
Maxam, A. M. and Gilbert, Co. Methods in Enzymol- 
ogy. 65: 499-560. Clone BH10 was sequenced by the 
method of Sanger modified by the use of oligonucleo- 
tides complementary to the M13 insert sequence as 
primers and using Wenow fragment of DNA polymer- 15 
ase I or reverse transcriptase as the polymerase. 

Formation of RNA, RNA Probes and DNA Probes 
Specific to HTLV-III 

20 

DNA sequences which are an entire gene or seg- 
ment of a gene from HTLV-III are inserted into a vec- 
tor, such as a T7 vector. In this embodiment, the vec- 
tor has the Tceu promoter from the T cell gene 10 pro- 
moter and DNA sequences encoding eleven amino 25 
acids from the T cell gene 10 protein. 

The vectors are then used to transform cells, 
such as E. coli. The T7 vector makes use of the T7 
polymerase, which catalyzes RNa formation and rec- 
ognizes only T7 promoter, which is the site where 30 
RNA polymerase binds for the initiation of transcrip- 
tion. The T7 polymerase does not recognize E. coli 
promoter. As a result, if HTLV-III DNA sequences are 
inserted after the promoter and polymerase genes of 
the T7 vector, which recognizes them to the exclusion 35 
of the signals, and a terminator is placed immediately 
after the HTLV-III DNA sequences, the T7 vector will 
direct manufacture RNA complementary to the HTlV- 
III insert. 

Determination of the nucleotide sequence of 40 
HTLV-III DNA also provides the basis for the forma- 
tion of DNA probes. BothRNAproves and DNA HTLV- 
III probes must have a distinctive region of the HTLV- 
III genome in order to be useful in detecting HTLV-III 
in body fluids. There is relatively little homology be- 45 
tween the HTLV-III genome and the HTLV-I and -II 
genomes and probes contain regions which are 
unique to HTLV-III (i.e., not shared with HTLV-I or -II). 
For example, nucleotide sequences in the env gene 
region of HTLV-III can be used. so 

Either viral RNA or DNA can be used for detecting 
HTLV-III in, for example, saliva, which is known to 
have a very high concentration of the virus. This can 
be tone, for example, by means of a dot blot, in which 
the saliva sample is denatured, blotted onto paper 55 
and then screened using either type of probe. If saliva 
is used as the test fluid, detection of HTLV-III is con- 
siderable faster and easier than is the case if blood 



is tested. 

Production of Monoclonal Antibodies Reactive with 
HTLV-III Polypeptides 

Monoclonal antibodies reactive with HTLV-III 
polypeptides are produced by antibody-producing cell 
lines. The antibody-producing cell lines may be hy- 
bridoma cell lines commonly known as hybridomas. 
The hybrid cells are formed by fusion of cells which 
produce antibody to HTLV-III polypeptide and an im- 
mortalizing cell, that is, a cell which imparts long term 
tissue culture stability on the hybrid cell. In the forma- 
tion of the hydrid cell lines, the first fusion partner - 
the antibody-producing cell - can be a spleen ceil of 
an animal immunized against HTLV-III polypeptide. 
Alternatively, the antibody-producing cell can be iso- 
lated B lymphocyte which produces antibody against 
an HTLV-III antigen. The lymphocyte can be obtained 
from the spleen, peripheral blood, lymph nodes or 
other tissue. The second fusion partner - the immor- 
tal cell - can be a lymphoblastoid cell or a plasmacy- 
toma cell such as a myeloma cell, itself an antibody- 
producing cell but also malignant 

Murine hybridomas which produce monoclonal 
antibodies against HTLV-III polypeptide are formed 
by the fusion of mouse myeloma cells and spleen 
cells from mice immunized against the polypeptide. 
To immunize the mice, a variety of different immuni- 
zation protocols may be followed. For instance mice 
may receive primary and boosting immunizations of 
the purified polypeptide. The fusions are accomplish- 
ed by standard procedures: Kohler and Milstein, 
(1975) Nature (London) 256, 495-497; Kennet, R., 
(1980) in Monoclonal Antibodies (Kennet et aL, Eds. 
pp. 365-367, Plenum Press, NY). 

The hydridomas are then screened for production 
of antibody reactive with the polypeptide. This can be 
performed by screening procedures known in the art. 

Another way of forming the antibody-producing 
cell line is by transformation of antibody-producing 
cells. For example, a B lymphocyte obtained from an 
animal immunized against HTLV-III polypeptide may 
be infected and transformeed with a virus such as the 
Epstein-Barr virus in the case of human B lympho- 
cytes to give an immortal antibody-producing cell. 
See, e.g., Kozbor and Rodor (1983) Immunology To- 
day 4 (3), 72-79. Alternatively, the B lymphocyte may 
be transformed by a transforming gene or transform- 
ing gene product. 

The monoclonal antibodies against HTLV-III poly- 
peptide can be produced in large quantities by injec- 
tin, antibody-producing hybridomas into the perito- 
neal cavity of mice and, after an appropriate time, 
harvesting the ascites fluid which contains very high 
titer of homogenous antibody and isolating the mono- 
clonal antibodies therefrom. Xenogeneic hybridomas 
should be injected into irradiated or athymic nude 
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mice. Alternatively, the antibodies may be produced 
by culturing cells which produce HTLV-III polypeptide 
in vitro and isolating secreted monoclonal antibodies 
from the cell culture medium. The antibodies pro- 
duced according to these methods can be used in di- 
agnostic assays (e.g., detecting HTLV-III in body flu- 
ids) and in passive immunotherapy. The antibodies 
reactive with HTLV-III polypeptides provide the basis 
for diagnostic tests for the detection of AIDS or the 
presence of HTLV-III in biological fluids (e.g., blood, 
semen, saliva) and for passive immunotherapy. For 
example, it is possible to produce anti p 41, to attach 
it to a solid phase using conventional techniques and 
to contact the body fluid to be tested with the immo- 
bilized antibody. In this way, HTLV-III (antigen) can be 
detected in the body fluid; this method results in far 
fewer false positive test results than do tests, in which 
antibody against HTLV-III is detected. 

This invention will now be further illustrated by 
the following examples. 

EXAMPLE 1 

PREPARATION OF SONICATED DNA 
FRAGMENTS 

10 ug of gel purified HTLV-III restriction frag- 
ments were sonicated to fragment size on average of 
500 bps. After sonication, the DNA was passed 
through a DEAE-cellulose column in 0.1XTBE in or- 
der to reduce the volume. The DEAE-bound DNA was 
washed with 5 ml of 0.2 M NaCI-TE (2 M NaCI, 10 mm 
Tris HCI pH 7.5, 1 mM EDTA) and then eluted with 1 
M NaCI-TE, and ethanol precipitated. The size range 
of the sonicated DNA was then determined on 1.2% 
agarose gel. DNA fragments of desired length (200- 
500 bps) was eluted from the gel. T4 DNA polymerase 
was used to fill in and/or trim the single strand DNA 
termini generated by the sonication procedure. DNA 
fragments were incubated with T4 polymerase in the 
absence of added nucleotides forf ive minutes at 37°C 
to remove nucleotides from 3' end and then all 4 nu- 
cleotide precursors were added to a final concentra- 
tion of 1 00 uM and the reaction mixture was incubated 
another 30 minutes to repair the 5'-end single strand- 
ed overhang. The reaction was stopped by heat inac- 
tivation of the enzyme at 68°C for 10 minutes. DNA 
was phenol extracted once, ethanol precipitated and 
resuspended in TE. 

EXAMPLE 2 

CLONING OF RANDOM SHEARED DNA 
FRAGMENTS 

The sonicated blunt end repaired HTLV-III DNA 
fragments were ligated into the Smal site of the ORF 
expression vector pMR100 and transformed into host 



cell LG90 using standard transformation procedures. 
B-galactosidase positive phenotype of the transfor- 
mant were identified by plating the transformed cell 
on ampicillin (25 ug/ml) containing McConkey agar 
5 plates and scoring the phenotype after 20 hours at 
37°C. 

EXAMPLE 3 

10 HYBRID PROTEIN ANALYSIS 

Ten milliliter samples of cells from an overnight 
saturated culture grown in L broth containing ampicil- 
lin (25 ug/ml) were centrifuged, the cell pellet was re- 

15 suspended in 500 ut of 1 .2 fold concentrated Laemmli 
sample buffer. The cells were resuspended by vortex- 
ing and boiling for 3 minutes at 1 00°C. The lysate was 
then repented by being forced through a 22 gunge 
needle to reduce the lysate viscosity. Approximately 

20 1 0 ul of the protein samples were electrophoresed in 
7.5% SDS-PAGE (SDS-polyacrylamide) gels. 

Electro phoretic transfer of proteins from SDS-PA- 
GE gels to nitrocellulose paper was carried out ac- 
cording to Towbin et aL After the transfer, the filter 

25 was incubated at 37°C for two hours in a solution of 
5% (w/v) nonfat milk in PBS containing 0.1% antifoam 
A and 0.0001% merthiolate to saturate all available 
protein binding sites. Reactions with AIDS antisera 
were carried out in the same milk buffer containing 

30 1 % AIDS patient antisera that had been preabsorbed 
with E. coli lysate. Reactions were performed in a 
sealed plastic bag at 4°C for 18-24 hours on a rotary 
shaker. Following this incubation, the filter was wash- 
ed three times for 20 minutes each at room tempera- 

35 ture in a solution containing 0.5% deoxycholic, 0.1 M 
NaCI, 0.5% triton X-100; 1 0 mm phosphate buffer pH 
7.5 and 0.1 mM PMSF. 

To visualize antigen-antibody interactions, the ni- 
trocellulose was then incubated with the second goat 

40 antihuman antibody that had been iodinated with 125 l. 
The reaction with the iodinated antibody was carried 
out at room temperature for 10 minutes in the came 
milk buffer as was used for the first antibody. The ni- 
trocellulose was then washed as previously descri- 

45 bed and exposed at -70°C using Kodak XAR5 film 
with an intensifying screen. 

EXAMPLE 4 

50 SCREENING OF THE HTLV-III ORF LIBRARY BY 
COLONY HYBRIDIZATION 

E. coM LG90 transformants were screened with 
HTLV-III DNA probes containing the DNA regions of 
55 interest(e.g. HTLV-III gag, env or Px gene specific se- 
quences ). Colonies were grown on nitrocellulose fil- 
ter and screened according to the procedure of Grun- 
steir and Hogness by using a nick-translated HTLV-III 
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DNA as hybridization probe. 

The DNA fragment was in general excised by re- 
striction endonuclease digestion, gel purified, and 
32 p-labeled to a specific activity of 0.5x10 s cpm/ug by 
nick-translation (Rigby, P.W.J, et aL, J.Mol. Biol. 113 , 5 
237 (1977). Duplicate nitrocellulose filters with DNA 
fixed to them were prehybridized with 6xSSC (0.9 M 
NaCI/0.09 M sodium citrate, pH 7.0), 5X Denhardt's 
solution (Denhardfs solution: 0.02% each of polyvi- 
nylpyrrolidone, Ficoll and bovine serum albumin) 10 10 
ug of denatured sonicated E. coli DNA per ml at 55°C 
for 3-5 hours. The filters were then placed in a fresh 
sample of the same solution to which the denatured 
hydridization probe had been added. Hydridization 
was permitted to take place at 68°C for 1 6 hours. The is 
filters were washed repeatedly in 0.3XSSC at 55°C, 
and then 

EXAMPLE 5 

20 

RECOMBINANT DNA PRODUCED PEPTIDE OF 
HTLV-III WHICH IS IMMUNOREACTIVE WITH 
SERA FROM PATIENTS WITH AIDS 

An expression vector, pIN-MI-ompA (ompa) was 25 
used. ompA has the lipoprotein (the most abundant 
protein in E.coli) gene promoter ( Ipp) and the lacUV5 
promoter-operator (Figure 1 ). ompA vectors also 
contain the DNA segment encoding the lac repressor, 
which allows the expression of the inserted DNA to be 30 
regulated by lac operon inducers such as IPTG. The 
ompA cloning vehicles contain three unique restric- 
tion enzyme sites Ecorl, Hindlll, Bam HI in all three 
reading frames and permit the insertion of DNA into 
and of these restriction sites. 35 

Various restriction fragments were excised from 
the recombinant clone, lambdaBHIO, which contains 
a 9 Kb long HTLV-III DNA insert in the Sstl site of the 
vector lambdagtWES lambdaB. These restrictions 
fragments were them inserted into the ompA vectors 40 
at all three reading frames and used to transform E. 
coli JA221 cells. Transformants were first screened 
for HTLV-III DNA by in situ colony hydridization using 
nick-translated HTLV-III DNA probes. The positive 
clones were then screened for expression of HTLV-III 45 
antigenic peptides using HTLV-ill specific antibodies. 
For this, lysates of E. coli cell containing HTLV-III DNA 
recombinant plasmids were electrophoresis on 12.5% 
SDS-polyacryiamide gel and electroblotted onto ni- 
trocellulose filters. The filters were then incubated 50 
first with well-characterized sera from AIDS patients 
and next with 125 l-labelled goat anti-human IgG anti- 
bodies. The washed filters were autoradiographed to 
identify peptides reactive with anti-HTLV-lll antibo- 
dies. 55 

Several gene segments that encode-peptides 
showing immunoreacuvity with anti-HTLV-lll antibo- 
dies were demonstrated. Among these is a 1.1 Kb 



EcoRI restriction fragment. This fragment was insert- 
ed into ompA vectors in all three reading frames (Fig- 
ure 5). Cells were grown at 37°C in L broth containing 
100mg/ml. ampicillin to an OD 600 of 0.2. At this time, 
the cell cultures were divided into two aliquots. IPTG 
was added to one aliquot to a final concentration of 2 
mM (induced). ITPG was not added to the other ali- 
quot (uninduced). Upon ITPG induction, transfor- 
mants of all three pi as mid constructs (designated 
OmpArR-6, (01 R6) OmpA r R-7 (02R7), and 
OmpA3-R-3 (03R3)) produced a 1 5 Kd peptide that is 
strongly reactive with anti-HTLV-lll antibodies in sera 
from AIDS patients (Figure 6 lane 1, purified HTLV-III 
virions; lanes 2 and 3, 01 R6 uninduced and induced; 
lanes 4 and 5, 02R7 uninduced and induced; lanes 6 
and 7 03R3 uninduced and induced). This reactivity 
is not detected when sera from normal individuals is 
used. 

DNA sequence data of the HTLV-II genome indi- 
cates that there is an open reading frame inside the 
pol gene located at the 5'-end of the EcoRI fragment. 
DNA sequence analysis of the three recombinant 
constructs, 01 R6, 02R7 and P3R3, confirmed that 
each of these recombinants has a different reading 
frame of the HTLV-III plus strand coupled to the cod- 
ing sequence of each vector. Only in 03R3 is the 
reading frame of the inserted DNA in phase with that 
set by the signal peptide in the ompA vector; in 01 R6 
and 02R7 the pol gene segment DNA is out of phase 
(Figure 6a), 

There is a 6 bp ribosome binding site, AAGGAG 
(Shine-Dalgarno sequence), located at nucleotide 
position 24-29 and an initiation codon, ATG, located 
11 bp downstream (position 41-43). The 15 Kd pep- 
tide synthesized by all three recombinants appears to 
be translated from the transcript using this internal 
codon. If this is true, the peptide starts from the ATG 
located at position 41-43 and ends at the stop codon 
at position 446-448, producing a peptide of 135 amino 
acid residue encoded by the 3'-end segment of the 
pol gene of HTLV-III. 

In addition to the 15 Kd peptide, the 03R3 con- 
struct, in which the reading frame of the HTLV-III DNA 
pol gene is in phase with that set by the vector, pro- 
duced two additional peptides about 19 Kd and 16.5 
Kd in size (Figure 6). It is possible that the 19 Kd pep- 
tide contains an additional 35 amino acid residues, 21 
of which are from the signal peptide encoded by the 
ompA 3 vector and 14 encoded by the inserted HTLV- 
III DNA itself. The 16.5 Kd peptide may be the proc- 
essed 19 Kd peptide in which the signal peptide is 
cleaved. 

The 01R6 and 02 R7 constructs also produces 
another peptide of about 17.5 Kd (Figure 6) and weak- 
ly reactive with sera of AIDS patients. The origin of 
this peptide is not clear. The 1.1 Kb EcoRI fragment 
contains a second potential coding region designated 
as the short open reading frame (SOR) extending 
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from nucleotide position 260 to 965 (Figure 5). Four 
of the five AUG methionine codons in this region are 
near the 5 -end of this open reading frame. This DNA 
segment could encode peptides of 192, 185, 177 or 
1 64 amino acid residues. However, there is no clearly 5 
recognizable ribosome binding site at the 5'-end of 
this open reading frame. 

Further evidence also supports the conclusion 
that the 15 Kd peptide is indeed derived from the po[ 
gene. First, deletion of the 3'-end Stul to EcoRI frag- 10 
ment from the 1.1 Kb EcoRI insert from 01r6, 02R7 
and 03R8 (Figure 5) does not affect the synthesis of 
the 1 5 Kd peptide. Second, clones containing only the 
5'-end EcoRI to Ndel fragment still produce the same 
15 Kd peptide. Finally, several recombinant clones 15 
containing various DNA fragments having the SOR 
coding sequence properly inserted into the open 
reading frame cloning vector, pMR100, produced 
lambdaCI-HTLV-lll B-galactosidase tripartite fusion 
proteins which have very little immunoreactivity with 20 
anti-HTLV-lll antibodies present in sera from AIDS pa- 
tients. 

Significant immunoreactivity against the 15 Kd 
peptide derived from the viral po[ gene in sera from 
AIDS patients was detected. The identify of this im- 25 
munoreactive peptide, with respect to the banding 
pattern of HTLV-III virion antigen in SDS-polyacryla- 
mide gel electrophoresis, was determined by means 
of a competition inhibition immunoassay. Purified 
HTLV-III virions were treated with SDS, electrophor- 30 
esis, and electroblotted onto a nitrocellulose filter. 
Identical filter strips containing disrupted HTLV-III vir- 
ions were incubated with well characterized serum 
from an AIDS patient in the presence or absence of 
lysates of 01R6, 02R7, or control bacterial clones. 35 
The specific immunoreaction between anti-HTLV-lll 
antibodies present in sera of the AIDS patients and 
the blotted virion proteins were then revealed by 125 l- 
labeled goat anti-human antibody. As shown in Figure 
7, lysates of 01 R6 block the immunoreactivity of the 40 
viral p31 protein with the AIDS serum, while lysates 
of control cells do not. This result suggest that the re- 
combinant 15 Kd peptide encoded by 3'-end of the vi- 
ral pol gene is also a part of another virion protein, 
p31 , in contrast to the view shared by some that p31 45 
is a cellular protein which co-purifies with HTLV-III vir- 
ions. 

The prevalence in the sera of AIDS patients of an- 
tibodies against the 15 Kd peptide was also evaluat- 
ed. In Western blot analysis employing the lysate of 50 
01 R6 as the source of antigen, a panel of coded sera 
from AIDS patients and normal healthy individuals 
was tested. All of the 20 AIDS sera and none of the 8 
norral control reacted with the 15 Kd peptide. Repre- 
sentative results are shown in (Figure 8). These data 55 
indicate that most, rf not all, AIDS patents produce an- 
tibodies against the viral p31 protein. 



EXAMPLE 6 

EXPRESSION IN E. COLI OF OPEN READING 
FRAME GENE SEGMENTS OF HTLV-III 

HTLV-III DNA was excised from lambda BH-10, 
which is previously construed recombinant lambda 
phage containing a 9 Kb segment of HTLV-III DNA in- 
serted into the vector lambdagtwes lambda B (Figure 
2a). This HTLV-III DNA was sonicated and DNA frag- 
ments of about 0.5 Kb purified by gel electrophoresis, 
end repaired, and inserted into the Smal site of the 
open reading frame (ORF) vector, pMR1 00 (Figure 9). 
This vector contains a bacterial lac promoter DNA 
segment linked to a second DNA fragment containing 
a hybrid coding sequence in which the N-terminus 
(5' segment) of the lambda CI gene of bacteriophage 
lambda is fused to an N-terminal-deleted laclZ gene 
(3'-segment). A short linker DNA fragment containing 
a Smal cloning site, has been inserted between these 
two fragments in such a manner that a frame shift mu- 
tation has been introduced upstream of the laclZ-cod- 
ing DNA. As a result, pMR100 does not produce any 
detectable B-galactosidase activity when introduced 
into cells of the Lac" host E. coli LG90. The insertion 
of foreign DNA containing an open reading frame, in 
this case the HTLV-III DNA, at the Smal cloning site 
can reverse the frame shift mutation if the inserted 
coding sequence is in the correct reading frame with 
respect to both the lambdaCI leader and the LaclZ 
gene. Transformants were screened on MacConkey 
plates to detect individual clones that expressed B- 
galactosidase enzymatic activity in situ . 

Among the 6000 ampicillin resistant transfor- 
mants screened, about 300 were found to express B- 
galactosidase activity. Colony hybridization using Re- 
labelled nick-translated HTLV-III DNA as a probe re- 
vealed that all these Lac + clones contained HTLV-III 
DNA. In the Lac + clones the HTLV-III fragment insert- 
ed into the Sma I site of pMR1 00 must contain no stop 
codons in the reading frame set by the lambdaCI 
leader segment and the laclZ gene must also be in 
the correct translation^ reading frame. The three- 
element-fused genes were expressed as tripartite fu- 
sion proteins, having a portion of the lambdaCI pro- 
tein at the N-terminus, the HTLV-III segment in the 
middle, and the laclZ polypeptide at the C-terminus. 

THe proteins produced by the Lac* clones were 
analyzed by resolving cell lysates on 7.5% SDS-poly- 
acrylamide gels along with those of the control Lac + 
clone pMR200, which produced a lambdaCI-B-galac- 
tosidase fusion protein. The laclZ gene in pMR200 is 
identical to that in pMR100 except that it has a single 
base pair deletion which brings it in phase with the 
lambdaCI gene to produce an active B-galactosidase. 
By virtue of the very large size of the B-galactosidase 
and its fusion proteins, they are separated from the 
bulk of proteins in the cell lysates on the SDS-polya- 
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crylamide gels and can be easily Identified by Coo- 
massie brilliant blue staining as shown in Figure 10a. 
Some of the Lac + clones containing HTLV-III DNA pro- 
duce polypeptides that are larger (15,000 to 27,000 
dalton) than the lambdaCI-laclZ fusion protein. The s 
findings are consistent with data thatthe DNA inserts 
are up to 700 bp long. The B-gaJactosidase fusion pro- 
teins accounted for about 1-2% of total cellular pro- 
tein. 

The peptides produced by the Lac + clones were 10 
examined by Western blot analysis for immunoreac- 
tivity with sera from AIDS patients. After the lysates 
of Lac* clones were electrophoresed in SDS-polya- 
cryl amide gels, they were electro-transferred to nitro- 
cellulose filters. These protein blots were first reacted 15 
with AIDS patient sera and then with 125 l-labeled goat 
anti-human IgG. The autoradiograph in Figure 10b 
shows the immunoreactivity of a representative fused 
protein with the serum from an AIDS patient. The re- 
combinant peptides also reacted with anti-B- 20 
galactosidase antiserum, consistent with the propo- 
sition that they had the general structure lambdaCI- 
HTLV-III peptide-LaclZ. From the immunoreactivity 
pattern of the negative controls, pMR100 and 
pMR200, which do not contain an HTLV-III DNA in- 25 
sert, it is evident that this particular AIDS serum con- 
tains antibodies reactive with several bacterial pro- 
teins of the host E coli. This is not surprising, since 
AIDS patients are usually infected with a number of 
bacteria. Absorbing AIDS patient sera with Sephar- 30 
ose 4B conjugated with E col[ extract reduced the 
background immunoreactivity to some extend but did 
not completely eliminate it. 

About 300 independent HTLV-III DNA-containing 
Lac + colonies were analyzed in SDS polyacrylamide 35 
gels using Coomassie brilliant blue staining and West- 
erns blotting. About half of them were found to ex- 
press fusion proteins containing extra peptides of 
about 100-200 amino acids, corresponding to DNA in- 
serts of 300-600 bp long. Of these fusion proteins, 20 40 
were found to react specifically with sera from AIDS 
patients. The unreactive clones probably contain pep- 
tides that fold in such a way that they are not reactive 
with antibodies or correspond to regions of HTLV-III 
protein molecules which are not immunogenic in AIDS 45 
patients. The other half of the Lac + clones expressed 
fusion proteins whose sizes were not obviously differ- 
ent from that of the lambdaCI B-galactosidase pro- 
tein. None from this group of fusion proteins was 
found to react with sera from AIDS patients. so 

THe HTLV-III DNA inserts from Lac + ORF clones 
were mapped to specific segments in the HTLV-III 
genome using Southern blotting procedures. In these 
studies, each plasmid clone was labelled with ^p by 
nick-translation and hydribized to a battery of HTLV- 55 
III DNA restriction fragments. This hybridization ana- 
lysis mapped all of the Lac + ORF clones into four open 
reading frame segments designated ORF- A, ORF-B, 



ORF-C, and ORF-D (Figure 2a) consistent with the 
DNA sequencing data. The open reading frames 
ORF-A and -B, corresponding to the coding regions 
of the gag and pol genes, are 1 .5 Kb and 3.0 Kb long, 
respectively. ORF-C is about 0.6 Kb long, slightly 
overlaps with the ORF-B region, and is capable of en- 
coding a polypeptide of 21 overlaps with the ORF-B 
region, and is capable of encoding a polypeptide of 21 
Kd. The location of ORF-C and its overlap with the pol 
gene are reminiscent of the structure of the env 
genes in HTLV-I and -II. However, ORF-C, designates 
as the short open reading frame (sor), is too short to 
code for the entire envelope protein. The fourth open 
reading frame, ORF-D is 2.5 Kb long and could en- 
code both a large precursor of the major envelope 
glycoprotein and another protein derived from the 
3' terminus, which may be analogous to the lor prod- 
ucts of HTLV-I and -II. this gene region of HTLV-III, 
designated env-lor , is at least twice as long as the lor 
of HTLV-I and HTLV-II and it is presently unclear 
whether single or multiple proteins are encoded here- 
in. 

Both Southern blotting and DNA sequencing 
studies were employed to analyze a number of 
clones. As shown in Figure 2b, the Lac + ORF clones 
expressing fusion proteins immunoreactive with sera 
from AIDS patients were located in ORF-A (e.g. #175 
and #191), ORF-B (e.g. #1 3, 31 , and 1 62), or ORF-D 
(e.g. #113, 121, and 127) and not in the sor region. 
Not all peptides in these regions were immunoreac- 
tive, e.g. ORF clone #76 located in ORF-D. 

Analysis of the open reading frame structures in 
HTLV-III posed questions as to which open reading 
frames(s) corresponds to the env gene. It is possible 
that the env-lor region in HTLV-III contains all or a 
part of the env gene in addition to the presumed lor 
gene. Recent evidence suggests that the lor in HTLV- 
I encodes a 42 Kd protein involved in the process of 
viral activation and transformation. When the lysate 
of one of the ORF clones (#127 in Figure 2b) was test- 
ed against sera from 20 AIDS patients and 12 healthy 
normals in a strip radioimmunoassay based on the 
Western blot technique, immunoreactivity against the 
lambdaCI-HTLV-lll-B-galactasidase fusion polypep- 
tide was detected in the sera from 19 of the AIDS pa- 
tients and none from normal controls. This result in- 
dicates thatthe protein encoded by the portion of the 
env-lor region contained in ORF clone # 127 is pro- 
duced in HTLV-III infected cells and induces antibody 
production in most if no all AIDS patients. 

Industrial Applicability 

This invention has industrial applicability in 
screening for the presence of HTLV-III DNA in body 
fluids and the diagnosis of AIDS. 
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Equivalents 

Those skilled in the art will recognize, or be able 
to ascertain, using no more that routine experimenta- 
tion, numerous equivalents to the specific substanc- 
es and procedures described herein. Such equiva- 
lents are considered to be within the scope of this in- 
vention and are covered by the following claims. 

The invention therefore provides an immunoreac- 
tive HTLV-III polypeptide expressed by cells trans- 
formed with a recombinant vector containing HTLV-III 
cDNA The HTLV-III cDNAmay encode an envgene 
sequence, an env-lor gene sequence or be an EcoRI 
restriction fragment, and/or be immunoreactive with 
sera of patients with acquired immunodeficiency syn- 
drome or sera containing antibodies to HTLV-III. 

The invention also provides; (a) isolated HTLV-III 
envelope polypeptide; or (b) isolated HTLV-III poly- 
peptide encoded by an env-lor gene sequence; or (c) 
isolated cDNA encoding an HTLV-III gene, which may 
encode the HTLV-III envgene, or the HTLV-III env-lor 
gene sequence. The cDNA may also encode an 
EcoRI restriction fragment of HTLV-III cDNA which en- 
codes a polypeptide immunoreactive with sera of pa- 
tients with acquired immunodeficiency syndrome or 
sera containing antibodies to HTLV-III, or an HTLV-III 
polypeptide which is immunoreactive. 

Preferably, the isolated cDNA encodes an envel- 
ope polypeptide which is immunoreactive with sera of 
patients with acquired immunodeficiency syndrome 
or sera containing antibodies to HTLV-III, and may be 
an EcoRI restriction fragment. 

The invention also provides a DNA probe com- 
prising a DNA sequence which is essentially homolo- 
gous to a portion of the HTLV-III genome unique to 
the virus. The probe may comprise a DNA sequence 
which is essentially homologous to a portion of the 
HTLV-III genome which encodes a polypeptide immu- 
noreactive with sera of AIDS patients. 

Also covered by the invention is a hybrid protein 
comprising an HTLV-III polypeptide linked to at least 
one other polypeptide, e.g. an indicator polypeptide 
which may comprise beta-gal acto-s id ase, and an iso- 
lated RNA transcript of the env gene of HTLV-III, 
which preferably has a label which emits a detectable 
signal, e.g. a radioisotope. 

The invention also provides a recombinant vector 
containing HTLV-III DNA capable of expression upon 
insertion into host cells; or an OmpA vector containing 
HTLV-III cDNA; ora pMR 100 vector containing HTLV- 
III cDNA. 

The invention also provides a method of produc- 
ing HTLV-III polypeptide, comprising the steps of: 

(a) cleaving HTLv-lll cDNAto produce cDNA frag- 
ments; 

(b) inserting the cDNA fragments into an expres- 
sion vector to form a recombinant vector; 

(c) transforming an appropriate host cell with the 



recombinant vector; and 

(d) culturing the transformed host cell under con- 
ditions sufficient for expression of the polypep- 
tide encoded for by the inserted HTLV-III cDNA. 

5 Preferably, the cleaving step comprises: 

(a) digesting the HTLV-III cDNA with restric- 
tion endonucleases to produce restriction 
fragments of cDNA, or 

(b) shearing the HTLV-III cDNA to produce 
10 cDNA fragments. 

Also provided is a method of producing HTLV-III 
envelope polypeptide, comprising the steps of: 
(a) cleaving HTLV-III genomic cDNA with the re- 
striction endonuclease Sstl; 
15 (b) digesting the cleaved cDNA with restriction 

endonucleases sufficient to generate restriction 
fragments which encompass at least a portion of 
the env gene; 

(c) isolating the restriction fragments; 
20 (d) producing DNA fragments of about 200-500 

base pairs in length from the restriction frag- 
ments; 

(e) isolating the DNA fragments of about 200-500 
base pairs; 

25 (f) inserting the isolated fragments into the open 

reading frame expression vector pMR1 00 for pro- 
duction of hybrid proteins comprising an env gene 
product and beta-galactosidase; 
(g) transforming lac 2"E.coli cells with the vecton 

30 (h) plating the transformed cells on MacConkey 

agar plates, maintaining the plates under condi- 
tions sufficient for the formation of colonies and 
selecting cell colonies exhibiting a red color; 
(i) culturing transformed cells from the selected 

35 colonies under conditions which allow expression 

of the hybrid protein; 

(j) obtaining cellular protein from the cultured 
transformed cells; 

(k) separating the cellular protein obtained; 

40 (I) contacting the separated protein with sera 
from AIDS patients to identify protein which is im- 
munoreactive with the sera; 
(m) isolating the immunoreactive protein; and 
(n) optionally separating the env gene expression 

45 product from the remainder of the hybrid protein. 

The invention also provides an antibody (e.g. 
monoclonal antibody) which is; 

(a) specifically reactive with HTLV-III envelope 
polypeptide; or 

so (b) specifically reactive with HTLV-III polypeptide 

produced by recombinant DNA techniques. 
Also provided is an immunoassay for the detec- 
tion of HTLV-III employing antibody (e.g. monoclonal 
antibody) 

55 (a) which reacts specifically with HTLV-III poly- 

peptide produced by recombinant DNA techni- 
ques; or which 

(b) reacts specifically with HTLV-III envelope 
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polypeptide. 

The invention also provides a sandwich type im- 
munoradiometric assay for the detection of HTLV-III 
employing an immobilized antibody which reacts with 
HTLV-III polypeptide and a soluble antibody which re- 
acts with HTLV-III polypeptide, and an assay kit com- 
prising an antibody which reacts specifically with 
HTLV-III polypeptide bound to a solid phase and a lab- 
eled soluble antibody which reacts specifically with 
HTLV-III polypeptide. 

The invention also provides a method of detect- 
ing antibodies against HTLV-III in a bodily fluid com- 
prising the steps of: 

(a) contacting an immunoadsorbent comprising 
an HTLV-III polypeptide bound to a solid phase 
with a bodily fluid until antibodies against HTLV- 
III polypeptide in the bodily fluid bind the solid 
phase polypeptide; 

(b) separating the immunoadsorbent from the 
bodily fluid; 

(c) contacting the immunoadsorbent with a lab- 
eled HTLV-III polypeptide or labeled antibody 
against human immunoglobulin; and 

(d) determining the amount of labeled polypep- 
tide bound to immunoadsorbent as an indication 
of antibody to HTLV-III. 

Also provided is a kit for determining the pres- 
ence of antibody against HTLV-III in a bodily fluid 
comprising: 

(a) an immunoadsorbent comprising a HTLV-III 
polypeptide bound to a solid phase; and 

(b) labeled HTLV-III polypeptide or a labeled an- 
tibody against human immunoglobulin. 

The invention also provides a method of detect- 



base pairs, and/or (b) is a restriction fragment 
produced by digestion at any of the sites shown 
in Figs 1 and 2, and/or (c) has the sequence 
shown in Fig. 3. 

5 

3. Polypeptide according to claim 1 or claim 2, 
wherein (a) the DNA encodes an env polypeptide 
and the DNA sequence extends from approxi- 
mately nucleotide 5780 to approximately nucleo- 
id tide 8370 of the HTLV-III DNA as shown in Fig. 3, 
or a fragment thereof; or (b) the DNA encodes a 
gag polypeptide (p17) and the DNAsequence ex- 
tends from approximately nucleotide 335 to ap- 
proximately nucleotide 730 of the HTLV-III DNA 
15 as shown in Fig. 3, or a fragment thereof; or (c) 
the DNA encodes a gag polypeptide (p24) and 
the DNA sequence extends from approximately 
nucleotide 730 to approximately nucleotide 1630 
of the HTLV-III DNA as shown in Fig. 3, or a frag- 
20 ment thereof. 

. 4. An immunoassay for the detection of antibodies 
to HTLV-III, comprising the steps of contacting 
polypeptide according to any one of the preced- 
25 ing claims with a test sample, and then determin- 

ing the presence of antibody bound thereto. 

5. An isolated DNA (e.g. a recombinant vector) com- 
prising a DNA sequence encoding polypeptide 

30 according to any one of claims 1 to 3; or a host 

cell transformed with said DNA 

6. A method of producing HTLV-III polypeptide, 
comprising the step of culturing the host cell of 
claim 5 under conditions sufficient for expression 
of the polypeptide. 

7. A method according to claim 6, wherein the 
HTLV-III polypeptide is an envelope polypeptide 
and the HTLV-III DNA is cleaved with restriction 
endonucleases to generate fragments encom- 
passing at least part of the env gene, comprising 
the steps of; 

(a) isolating the restriction fragments, 

(b) producing and then isolating fragments of 
about 200 to 500 base pairs in length from the 
restriction fragments, 

(c) inserting the isolated fragments into the 
open reading frame expression vector 
pMR100 for production of hybrid proteins 
comprising env gene product and beta- 
galactosidase, 

(d) transforming lac Z~ E. coli cells with the 
vector, 

(e) plating the transformed cells on MacCon- 
key agar plates, maintaining the plates under 
conditions sufficient for the formation of col- 
onies and selecting cell colonies exhibiting a 



ing HTLV-III nucleic acid in a bodily fluid (e.g. a cell 35 
lysate) comprising the steps of: 

(a) adsorbing the nucleic acid in a bodily fluid 
onto an adsorbent; 

(b) denaturing the adsorbed nucleic acid; 

(c) contacting the adsorbed nucleic acid with an 40 
HTLV-III DNA or RNA probe; and 

(d) determining if the probe hybridizes with the 
adsorbed nucleic acid, and a hybridoma cell line 
which produces antibody specifically reactive 
with HTLV-III envelope polypeptide. 45 



Claims 

1. HTLV-III polypeptide expressed by cells trans- so 
formed with a recombinant vector containing 
DNA encoding a gag , env, env-lor sor and/or pol 
polypeptide, or an immunologically functional 
equivalent thereof, the polypeptide being immu- 
noreactive with HTLV-lll-specific antibody. 55 

2. Polypeptide according to claim 1, wherein the 
DNA; (a) is a fragment of approximately 200-500 
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red colour, 

(f) culturing transformed cells from the select- 
ed colonies under conditions which allow ex- 
pression of the hybrid protein, 

(g) obtaining cellular protein from the cultured 5 
transformed cells, 

(h) separating the cellular protein obtained, 

(i) identifying protein which is immunoreactive 
with the sera, 

(j) isolating the immunoreactive protein, and w 
optionally 

(k) separating the env gene expression prod- 
uct from the remainder of the hybrid protein. 

8. A hybrid protein comprising polypeptide accord- 15 
ing to any one of claims 1 to 3, linked to a poly- 
peptide that is not immunoreactive with antibo- 
dies to HTLV-III. 

9. A DNA probe or an isolated RNA transcript com- 20 
prising a nucleic acid sequence essentially hom- 
ologous to the DNA as defined in claim 5. 

10. Monoclonal antibody specifically reactive with 

the polypeptide according to any one of claims 1 25 
to 3; or a hybridoma cell line which produces said 
antibody. 

11. An immunoassay for the detection of HTLV-III 
employing antibody according to claim 10. 30 

12. An assay kit comprising a solid phase bound an- 
tibody which reacts specifically with HTLV-III 
polypeptide and a labeled soluble antibody which 
reacts specifically with HTLV-III polypeptide, 35 
wherein the polypeptide is according to any one 

of claims 1 to 3. 

13. A method of detecting antibodies against HTLV- 
III in a bodily fluid, comprising the steps of; 40 

(a) contacting an immunoadsorbent compris- 
ing an HTLV-III polypeptide bound to a solid 
phase with a bodily fluid until antibodies 
against HTLV-III polypeptide in the bodily fluid 

bind the solid phase polypeptide, wherein 45 
said polypeptide is according to any one of 
claims 1 to 3, 

(b) separating the immunoadsorbent from the 
bodily fluid, 

(c) contacting the immunoadsorbent with the so 
labeled HTLV-III polypeptide or labeled anti- 
body against human immunoglobulin, and 

(d) determining the amount of labeled poly- 
peptide bound to immunoadsorbent as an in- 
dication of antibody to HTLV-III. 55 

14. An assay kit for determining the presence of an- 
tibody against HTLV-III in a bodily fluid, compris- 



ing; 

(a) an immunoadsorbent comprising a 
HTLV-III polypeptide bound to a solid phase, 
wherein said polypeptide is according to any one 
of claims 1 to 3, and (b) a labeled HTLV-III poly- 
peptide or a labeled antibody against human im- 
munoglobulin. 

15. A method of detecting HTLV-III nucleic acid in a 
bodily fluid (e.g. a cell lysate) comprising the 
steps of; 

(a) adsorbing the nucleic acid in a bodily fluid 
onto an adsorbent, 

(b) denaturing the adsorbed nucleic acid, 

(c) contacting the adsorbed nucleic acid with 
HTLV-III DNAor RNA probe according to claim 
9, and 

(d) determining if the probe hybridizes with 
the adsorbed nucleic acid. 

16. A diagnostic reagent comprising a polypeptide 
according to any one of claims 1 to 3, which is for 
example labeled and e.g. further comprises a sol- 
id phase on which the polypeptide is immobilized. 

1 7. A polypeptide according to any one of claims 1 to 
3, for use in therapy, e.g. vaccination. 

18. Use of a monoclonal antibody according to claim 
10 for the manufacture of a medicament for use 
in immunotherapy against AIDS. 

19. Use of a polypeptide according to any one of 
claims 1 to 3 for the manufacture of a medicament 
for use in vaccination against AIDS. 
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C10NC wcicoim AMINO 

POSITION ACII 

■tsioue 

Mil TOCAAOOOCTAATTCACTCCCAACOAAOACAAOA -420 

wu - 

(Im HI) 

IN II TATCCTTOATCTOTGOATCTACCACACACAAOCCT&CTTCCCTCUTTAOCAOAACTACACACCiOOOCCAOOOAT -143 
INI -C 0 AO- 

•JHI Cf^I^CCAXT^TTT^W -271 

IN II * OCA OA OA A CACC AOCT TOT T AC ACCCTGTGA OCCTOC AT OOAAT OOA TOACCCOOA0A6 AOAACTOT T AOA OT 0 -IM 
INI T 

•Nil <MGOTTT(MCA0CCOCCTA0CATTTCATCACAT0OCCC0AGAXT0CATCC(M0TACTTCAA0AACT0CTCACA - 121 



IN 11 TCGAKTTOCTACAAOGOACTTTCCQCTGOOGACTTTCCAGOOAGOCGTOOCCTOGOCOOOACTOGGOAOTGOCO -AS 
TATA 

IOX fvw II US— 1 
IN II ACCCCTCACA TCC TOCA TAT AAOCAOCT OCT T T T TOCCT 0 T AC T -I 



« Iql II Stt I 

1 10 OOCTC TCTC TOOT T AGACCAGA 



•JMO GGGTC TCTC TGGT T AGACCAGA TCTOAGCC TGGGAGCTC St 

NXM TCTOOCTAACTAOOOAACCCACTOCTTAAOCCTCAA 75 

Mind III ■ 1- US 

NXI2 TAAA- CTTOCCTTOAOTOCTTCAAGTAOTOTCTGCCCOTCTOTTOTOTOACTCTOCTAACTAOAOATCCCTCAOA 151 

«5^1 - UNA-lytlf»# 1 U#d«r *»*#«nC4 

II 

NXIZ CCCT T rTAOTCAOTOTOOAAAATCTCTAOCAGTOOCOCCCCAACAOOGACCTOAAAOCOAAAOCOAAACCA It 1 

IN 10 GACCTCTCTC6AC0CAOOACTCOOCTTGCT0AA0CGCOCACGGCAAOAGCCGAOGGOCGGCOACT0GT0AGTAC0 294 

•N5 - - - 

Ul4ar MOu«n C | 1 OAC #17 

IN 10 CCAAA AA T T T T OA CT AOCGGAGOC T AOA AOOAO A OA OA T CGOT GCGAG AQCG TC AG T ATT AAOCOOCOOACAA T T 17 f 

m H»tGIyAl»ArgAUS«rV*ll«wS«r01yOlvGJul*w 13 

IN 10 AOATCGATGOCAAAAAATTCGGTTAAGGCCAGGGGGAAAGAAAAAATATAAATTAAAACATATAOTATOOGCAAG AAA 

AftpArgTrpGlwly«n«ArQl*wAfQ^roGlyGlyly«Ly»lytTyrly«l«vlytNi »I l«V«lTr»AU3«r 31 

IN 1 0 CAGGOAGCTAGAACOATTCOCAGTT AATCCTGGCCT0TTAOAAACATCAOAAOOCT0TA0ACAAATACT0GOACA 32 1 

^ ArqGlulMGluApQfK«Al«ValA^roGlyLttul»uGlwThpS«pGIuGlyCyaroGlnU*l«wGIy01n A3 

IN ID GCTACAACCATCCCTTCAOACAGGATCAGAAGAACTTAGATCATTATATAATACAGTAGCAACCCTCTATTOTOT 394 

L«iGlftfro3ttPL«wGlfiThrGlyScrGluGUlMApQS«pl«uT v pA«nTnpV»lAl«ThPl«uTyrCytVal OA 

Ins — — — - 

Hind III 

IN 10 GCATCAAAGGATAGAGATAAAAGACACCAAGiiAAGCTTTAOACAAGATAGAGGAAGAGCAAAACAAAAOTAAGAA 47 1 

INS H *»°A'>A»-on»CluIUiy«A«pThrLy»GluAl«l«u«»ply«n«CluGluGloGlftA«rtLy»S«rlyilyt 113 

rVu II GAO #17 1 GAG #24 

IN 10 AAAAGCACAGCAAOCAGCAGCTGACACAGCACACAGCAGTCAGGrCAGCCAAAATTACCCTATAGTOCAGAACAT 744 

LyaUGlnCli%AI«AUAl«A»pThrGlyMi«S«rS«rGlfiV«IS«rGl*a«nTyprr«lUValGlnA«nIU 130 

Ah* III 

IN10 CCAGGGOCAAATGGTACATCAGGCCATATCACCTAGAACTTTAAATGCATGGGTAAAAGTAOTAOAAGAGAAGGC 021 

IMS ^l«Gl / GlnN«kV.lM|»GlfiAi.Il«S«r^r 0 ArQThrL»wA v >Al.TrpV,Uy«V.lV*lGl w Gl«LyaU 143 

Ah* IU 

IN10 TTTCAGCCCAGAAGTAATACCCATOTTTTCACCATTATCAGAAGOAGCCACCCCACAAGATTTAAACACCATOCT 504 

^ *K«S«rr>oGluV* 1 1 1 t h*S»r A 1 *L • w S«rGlwGl yA 1 *Thr f roGl*A»pt. *vA */>ThrK*t I mi HI 

I 

IN II AAACACAGTGGCGGGACATCAAGCAGCCATGCAAATOTTAAAAGAGACCATCAATGAGGAA6CTGCAGAATGGGA 97 1 

MJ _A^T»^V^GlyGlym»GlnAl*AUN«tGl«M«U M ^ 213 

IN 10 TAGAGTACATCCAGTGCATOCAGGGCCTATTGCACCAOGCCAGArGAGAGAACCAAOGGOAAGTGACATAOCAGG 1044 

Ar 4 V«lNi»»reV«lNl«Al«GlyrroU«AUProGlyGUIU«AraG!^peAfqOly$^Aft0liaUGly 230 

IN 10 AACTACTAGTACCCTTCAGOAACAAATAGGAT0GATGACAAATAATCCACCTATCCCA6TA0OA0AAATTT4TAA 1121 

ThrThr$«rThrl#uGlAGluGlnn«GlyTrpn«tThrAti%AtnPre^roU«fpoV«10iyGlvIl«Tyrly« 243 



IN 10 AAGATOGATAATCCTOOGATTAAATAAAATAGTAAGAATGTATAOCCCTACCAGCATTCTOGACATAAGACAAGO 1 194 
§NS _^?J^UIUlywGlyl#*A**lyilUV«lAr^ 201 
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BHIO AXCAAAAOAACCTTTTAOAOACTATOTAOACCOOTTCTAIAAAACTCTAAGAOCCGAOCAACC^ 1271 

AH 10 JUUAAAT T0OATOAC AQAAACt UOUOOTeCAAAATOCGAACCCAOATTOT AAOACT A? TIT AAAAOCATIOOO 1346 

IMS I^?i« Trv ^^«« L I ° , *** m * %A 1 «A ^P^o**^ v»t V«T^ 1 1 »wt r 1 *l •wGl v SSI 

ACCAOCOOCTACACTAOAAOAAATOATOACAOCATOTCAGCCAOTAGOACOACCCOOCCATAAGGCAACAOTTTT 1421 
INS .^;?i;?f;^ l ^ lw0 >"^*^ kThrA l» c V» 6 >«C»vV*IClv0lyffo0lvHUlyUUArqV*n«u 363 

•Nlf OGCTCMAOCAATCACCCAAOTAACAAATACAOCTACCA7AATOATCCAOAOAGOCAATTTTACCAACCAAAOAAA 1496 

INS .5!;?i^i;^J*» r0| " v * |ThrA ^ ThrA l» T ^n«^ St I 

SprThr 

•Hit OATOOTTAAGTOTTICAATIOTGCCAAAGAAGGGCACACAOCCAOAAATTOCAGGGCCCCTACGAAAAAOGGCTG 157 1 

. IU ly« Arg 

• Direct t«p«*« 4 | — f 01 lol II 

OHIO ttggaaatgtggaaaggaaggacaccaaatgaaagattotactgagagacaggctaattttttaggcaagatctg 1644 

Trpty»Cy»Gtyly«GluGlyNi »Gli>rWttyftAmpCvsThrGluAraGlfiAUA«APN«L«wGtylytIl«Trp 431 
|HJ m _ ft>«f h«ArQ6l uA»pl mu A 

•Hit GCCTTCCTACAAGOGAAGGCCAGGGAATTtTcTTCAGA^AGA 172 1 

froS«rT y rlysGlyAroProGlvA««irh«L«uGlnSttrAroProGluProThrAUPro^ro^h«l«uGlfvS«r 
INS A ^^ l ^ ClnC1 ^V^ 1 * Ar ^ lwr ^ SflrS * rC ^GlnfhrAroAUA»nS«rProThrM«S«rS«rGlu 

BHIO CAGACCAGA«CAACAGCCC^ 17,4 ... 

Ar^roCiuProThrAUProPr«GUGlvS«rPh«ArQS«rGivV*lClvThrThrThrrrprroClply 

©lnTh#ArQAUA*rtWProIhrAroArQ<Hul«^ 56 

ms I:: " 

l Mi 

GAG ptj 1 

IHIt GGAGCCGA1AGACAACGAACTGTATCCTTTAACTTCCCTCAGATCACTCTTTCGCAACGACCCCTCGTCACAATA 107 1 

GluProIUAtpty»Glul«vTyrPr©l«uThrS«rl««ArqS.rl^ 512 
GlyAWA»pAroCl*GlyThrV«lS«rPh^*nPh«ProGlnn^^ 01 
•H5 " 

1H10 AJK»TAGCC«K>CAACTAAAGCAAGCTCTATTACATACAG^ 1944 

ly»Il»ClyGiyClrtt«wly»GluAUl««l«uA»|»ThrGlyAl»A»pA»pthrV«ll#uGluClu««*S«rL«« 106 

AH 5 " " " — 

BHIO CCAOGAAGATGGAAACCAAAAATGATAGCGGCAATTGGAGGTTTTATCAAAGTAAG^AGTATWTCAGATACTC 2021 

ProClyArqTrpLy%Proty»H«tIl«ClyGlyIl»GlyClyPK«U«Ly»V»lArQClnT^ 131 
INS — — — — — 

BHIO ATAGAAATCTGTGGACATAAAGCTATAGGTACAGTATTAGTAGGACCTACACCT6TCAACA7AATTGCAACAAAT 2094 

Il«Clull«Cy»ClyHi»ly»Al»Il»GlyThrV*n«uV»|ClyProTHrProV«lAinIl»II«ClyAroA»n 156 
BHS - " 

BHIO CTGTTGACTCAGATTGGTTGCACTTTAAAT TTTCCCATTAGCCCTATTCAGACTGTACCAGTAAAATTAAA6CCA 2l7l 

l«vl«uThrGlnIl«CiyCy»Ihrl«vA»nPh«ProII«S»rProIl«GluThrV«lProV«nyll«wLy»Pro *»1 

•H5 — - t A 

BH10 GCAAIGGATGCCCCAAAAGTTAAACAATCCCCATTGACAGAAGAAAAAATAAAAGCATTAGTAGAAATTTGTACA 2244 

GlyfUtAftpGlyPr0lyftV«Uy>GlnTrpPrpl«uThrGluGluLy«IlttLy«AlaL«uV*lGluIl«CyiThr 206 

BHS - * 

,M10 GlJl^^lJly^GluGlyty^lI^ 251 
BHS A 

BHIO AAAAAAGACAGTACTAAATCCACAAAATTAGTACATTTCACAGAACTTAATAACAGAACTCAACACTTCTCGGAA 2596 
ly»ty*A»p5«rThrly*TrpArQLy*l«wV*lA*pPh«ArQGloL«uA»nly%ArqThrGlnA%pPh»TrpCl« 

IH5 -6 

Arg 

AH 10 GTTCAATTAGGAATACCACATCCCGCAGCGTTAAAAAAGAAAAAATCAGTAACAGTACTGCATGTGGGTGATGCA 247 1 

ValGlnL«uGlyIl«PreHisProAl*Glyl«uLymLyftlysLysS«rV«lThrV«U*vAspV»lGlyA«pAl« 20 1 

BH5 — G " 

BHIO TATT1TTCAGTTCCCTTAGATGAAGACTTCACCAAGTATACTGCATTTACCATACCTACTATAAACAATGAGACA 2544 

TyrPh«S*rV«lProL«uAftpGluA«pPh«ArolysTyrThrAUPh«ThrIUProS«rlUA«nAvtGluThr 
BH5 — T— 

BH10 CCACGGATIACATATCAGTACAATGTGCTTCCACAGGCATGCAAACGATCACCAGCAATATTCCAAAGTAGCATC 242 1 

ProGlyIl«AroTyrCInTyrA»nV»ll«uProCinClyTrply»Gly5«rProAl«Il»Ph^lnS«rS«rH«t SSI 

BHS G~G - 

5«rGly 

BHIO ACAAAAATCTTAGAGCCTTTTAAAAAACAAAATCCAGACATAGTTATCTATCAATACATGGATGATTTGTATGTA 2496 

ThrlyftII«L*uGluProPh«L^ftlysGlnA»nProAftpn«ValIUTyrGlnTyrH«(AftpAtpl«uTyry»l S56 
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SM19 •OATCTQACTTAOAAATAOOOCAOCATAOAICUMITAOAOOAOCTOAQACA^ 2771 

ftN It ACCACACCAOACMAAAACATaOAAAOAACCTCCATTCCTTTOOATOOOTU^ 1144 

fVv 11 

0« I 9 ACAOTACAOCCTATAOTOCTOCCAOAAAAAOACAOCTOOACTOTCAATOACATACAOAAGTTAOTOOOOAAATTO 2921 

ins I^!i?i-!™!!!i-™!^!!il^!?^I r * T ^ 1 
ill 

AA T T OOOCAAO TCAOA T T T ACCC AOOOAT T AAAOT AAOOC AA TT A TOT AA ACT CC T ? AOAOOAACC AAAOC AC T A 2991 
|M5 ?^!^i;^?*^ UT *{ fr »°WXl»ly»VftlAro01n^ 434 

9H 19 CATOOACTOTATTATOACCCATCAAAAOACTTAATAOCAOAAAT ACAOAA0CAOOOOCAA0OCCAAT6OACATAT 3 144 

N<«01yVaITyrTyrA4pProS«rly»At^l«u!I«AUOlutl401nlysOlfi0ly01ii01y0lATppThrTyr 304 

9H5 ••••••• ■»»«►«» 

Aha III 

Alt 19 CAAATTTATCAA0AGCCATTTAAAAATCTOAAAACA0OAAAATAT0CAAOAAT6A00COT0CCCACACTAATOAT 3221 

IH5 ?i^f; T y r0l ^ lMpr » Ph * L V»A»nl*uly«ThrOlyty«TvrAUAr^l«*ArqClyAl»MUThrA»«A»p 331 

•Mil OT AAA AC A AT T AA C A GAGOC ACT GC AA A AAA T AACC AC AO A A AOCA T ACT AA T A TOOOOAAACAC T CC T AA A t TT "m 

>h5 v;"!**?'!"!^!;*!;^ 354 

AH 11 AAACTACCCATACAAAAOGAAACATGGGAAACATOGTOOACAGAOTATTOQCAAOCCACCTGGATTCCTGAOTOO 337 f 

H49 IvilMPrelltflftlytflvTlvTrpGl^^ 311 

OAOTTTOTTAATACCCCTCCTTTAOTCAAATTATOGTACCAOTTAOAOAAAOAACCCATAGTAOOAOCAOAAACC 3444 
»H5 ^rMVtlA«nTlipProP^l«wValLy»l«uTrpTyH}lnl«iGlwLy«OtwProIlaV»101yAU61uThr 494 

3M14 TTCTATOTAGATOOGOCAOCTAACAGOOAOACTAAATTAGOAAAAOCAGQATATOTTACTAACAAAOOAAGACAA 332 1 

3«r Arg 

9M19 AACOTTOTCCCCCTAACTAACACAACAAATCAOAAAACTOAOTTACAAOCAATTTATCTAOCTTTOCAOOATTCA 3394 

. . ly«V«lV«irret«wThrA«nThrTKrA«AGlf«Ly«TKrOlvl«uGlnAl«lUTyrlauAUlM«01«iA«pS«r 434 

9H3 —A A C G A C 0 

Thp His Air 

9N19 OGATTAOAA0TAAACATAOTAACAOACTCACAATAT0CATTAOOAATCATTCAAGCACAACCA0ATAAAA0T0AA 3471 

IMS ?^^? iuV « 1A ^ I »« v *l T ^*«P 5 * r01ftT V r *l >l «« o lyn«Il«GlfiAl«01nPro»tply»S«rGlu 41 1 

9H 1 9 TCAGACTTAOTCAATCAAATAATAGAGCAOTTAATAAAAAAOOAAAAOGTCTATCTOOCATGoStACCAGCACAC 3744 

SMS ^^ 1 ^*uValA B nOlnIl«Il«GluGlnL«uIl«Lytly«OluLyiValTyrl«uAl«TrpV«lProAlaMit 791 

•Hi 9 AAAOOAATTOOAOGAAATOAACAAOTAOATAAATTAOTCAOTOCTOGAATCAOGAAAATACTATTTTTAOATOOA 392 1 

9H19 A T AOA T A AOOCCC A AGA T OAACA T GAG AAA T A TCACAOT AA T TOO AOAGC AA TOOCT AOT OA T T T T AACC TOCCA 3994 

PVw t] 

9N19 CCTOTAOTAOCAAAAOAAATAGTAOCCAOCTOTOATAAATOTCAOCTAAAAOOAGAAOCCATOCATOOACAAOTA 3971 

sms -!?!!il!!i?i!il!?i-fi!! # i*i*?! rCva ^ ly<v ^ lal ^ ly ^ 1 ^ lM * UH- * Mt * cly ? lnV * 1 741 

ANI9 OACTOTAOTCCAOOAATATOOCAACTAOATTOTACACATTTAOAAOOAAAAOTTATCCTOOTAOCAOTTCATOTA 4944 

9N19 ^CAOTOOATATATAOAAOCAOAAOTTATTCCAOCAOAAACAOOOCAOOAAACAOCATATTTTCTTT^AAAATTA 4121 

SMS ?i!?^ivTyrn«OiwAl^lwV»in«Pr^AUGlwT^r01y01«01wT^Al # T y rPh«l«s,t»wlvil #w 43 1 

•NI4 OC A OOA AOA T OOCC AOT A AA AAC A AT ACAT AC AOACAAT OOCAOC AA TT TCACC AO TOCT ACOO T T AA OOCCOCC 4194 
•HI4 TOTTOGTOO0C00OAATCAAOCA0OAATTTO&ATTCCCTACAATCCCCAAAOTCAA0OA0TA0TAOA 427 1 

9H 14 AATAAAOAATTAAAOAAAATTATAOOACAOOTAAOAOATCAOOCTOAACATCTTAAOACAKAOTACAAATOOCA 4344 

sms ™i!!?^-i^^-^!i^!Ii!^^!?i^?i!!^!i^C?!!??^™* 01w>,, »t •uiy»Thr ai • v« i oil*** ai • 904 



FIG. 3 (can.) 
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1*10 •TATKMCCAUMmMUoAA^ Mt1 

MS * 

IM10 iCAACAOACATACAAACTAAA<UATTACAAAAACA^^ MM 

Mil AOTOACATAAAAOTAOTOCCAJWAOAAAAOCAAAOATCATTAOOOATTATO^^ 4444 
fOl 1 

IN i • TOTOT0OCAA<nAOACA0OATOA<KUTTAOAACAT00AAAA0TTTA0TAAAACACCATAT0TAT0TTTCA0O0AA 472 1 

C 5!l^^l^lA^r^Aro??^ « 

||f} ••••••••••••••••••••••••••••••• • • *Q 

Ar* 

IN II A0CTAOO0OATO0TTTTATAWCATCACTATOAAA0CCCTCAICCAA(MUTAAOTTCA0AA0TACACATCCCACt 4794 

wj AUArqOlyTrprWyrArgm»NliTyrOlgS«rrroN1»Fr©ArQn^ 70 

IN i« AOOOCATOCTAOATTOOTAATAACAACATATTOOOCTCTOCATACAOOAOAAAOAOACtOOCATTTOOOTCAOOO 417 1 

^ OlyAipAlaAralMVtlllftThrThrTyrTrpOlyiauNUT^ 13 

IN II A0TCTCCATAMATOOA0OAAAJUOAOATATA<^ 4944 
INS 



V«lS«rIl««lyTr|>ArQly*lgargTyrWThrOl«VtU»p^ 120 

IN II OTATTACTTTOACT0TTTTTCAOACTCT0CTATAA0AAA0OCCTTATTA00ACACATA0TTA0CCCTAO0TCT0A 9121 
INI TyrT V rPK*A»^y»fK«J«rA.p5«rAun»ArQLy»AUL«wl»uClyH««Il«V«lS«rrroAroCy«Olu 

IN 10 ATATCAAOCAO0ACATAACAAOOTAOOATCTCTACAATACTTOOCACTAOCAOCATTAATAACACCAAAAAA0AT 5094 
^ Tyr01i«AU01yNlftA«nLy«V«l01yS«rL«u01ftTyrL«wAl«t«iAUAl»lMtl«ThrPr«Ly»ly«ll« 

IN II A A AOCC ACC T T TOCCTAGTQTT ACOA A ACT OA C AOAOCAT A G A T COA AC AAOCCCC AOAAOA CC AAOOOCC AC AO 317 1 

ly«ProfreiftuProS«rVftlThrty«l«uThr01wA»pAroTrpAahLy»fre01fily«Thrlyi01yNI«Arg 103 

301 I 

IN 10 AOCOAGCCACACAATCAATOOACACTAOACCTTTTAOAOOACCTTAAOAATGAAOCTOTTAOACATTTTCCTAOO 3244 
IMS 



IN 10 ATTTOOCTCCATOCCTTAOOOCAACATATCTATOAAACTTATOGOOATACTTOOOCAOOAOTOMAOCCATAATA 5321 

IMS - - 

IN 10 AcStTCTOCAACAACTOCTOTTTATCCATTTTCAOAATtOCOTOTCCACATAOCAOAATAOOCOTTACTCOACA 5394 

IMS A— - 

IN 10 0AOOAOAOCAAOAAATOOAOCCAO7AOATCCTAOACTAOAOCCCTOGAAOCATCCACOAA6TCAOCCTAAAACTO 547 1 

INS - 

IN 10 CTTOTACCAATTOCTATTGTAAAAAOTOTTOCTTTCATTOCCAAGTTTOTTTCATAACAAAAOCCTTAOOCATCT 5544 

IMS -C-— ™ 

ims --I^™!™?.™^"--^"™*^I" TCM ^*° TCACACTCATCAA0 TTCTCTATCAA 5421 

INI --0---A--G-A 

IN10 AGCA0TAACTA0TACATGTM7OCAACCTATACAAATAOCAATAGTA0CAC )Tt AOT AOTAOCAATAAT AATAOCAA 5494 

IKI -C -C T--C-A-— T-OCC C 



CUV-lOft 



F/G.3mv 



143 
170 



0iyWrMUThrN*U»*01yM»« _ _ 205 



M II ftACTAATAOAUOAOCAOAAOACAOTOOCAATOAQAOTOAAOQAOAAATATCAOCACTTOTOOAOATOOOOCTOO 5144 

iys01«01nlysThrV«lAUM«4Ar«V«llytGlwly»Tyr01fiN1sL«yTrpArtTrpClyTrp 22 

Ml 

IN 10 AOAT0OOOCACCATOCTCCTTOO0AT0TT0ATOATCT0TA0T0CTACAGAAAAATT0TOOOTCACAOTCTATTAT 5921 

AroTrp01yThrM«%t«ui«^ly^l«uVMIl«Cy»3ttrAUThp01yiy«L«uTr»V«lThrV»lTyrT|r 47 

IN 10 OOgStaCC70I6TOOAA0OAAOCAACCACCACTCTATTTT0TGCATCA0ATOCTAAAOCATATOATACAOAG0TA 5994 

6|yV»lPreV«lTrpiya01uAlaTMTHrTKrL«ifh^yai»S*f-AsoAl*ly«Al9TypA«pThr01wVal 72 
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•Mil ttTWOmOOOCCACACATGCCTOTOTACCCACAAiCCCCAACrc^ 4471 

•Nit GAAAA T T TT A AC A TOT OOAAAA A T OACA T |GT AOAACAGATOCA TOAOGAT A T AA TCAGT TT A TGOOATCAA AOC 4 144 
* ' ......... 

•Hit €T AAA (KCATOTOTAAAATTAACtCCACtCTOTITTAOTTTAAAOTOCACTtATTTOAAOAATOA TACT AATACC 4*2 1 
|H| ^^y; p ^ c V« v *^y^^ T ^^L«uCy«V»lS«rLMly«Cy«ThrAc#lwlva^M^AMThr 

■ —————— H 

IN II AATAOTAOTAOCOOOA(UAT(UTAAT(X>AOAAAOOAOA(UTAAAAAACTOCTCTTTCAATATCAOCACAAOCATA 1 244 
lid A^*£ s «'*«rflvA'^U4rWk01wlyg01yGl«Il*ya^ 

.... 

•Mil AOAOOTAAOOTOCAOAAAOAATATOCATTTTTTTATAAACTTOATATAAf ACCAATAGATAATGATACTACCAOC 417 1 

in* — ??i*-*— !i?i"il!?i* TvrAuf ^^ T ^^ 
■ 

•Hit TATAC0TTOACAAOTT9TAACACCTCA0TCATTACACA0OCCT0TCCAAA00TATCCTTT0AOCCAATTCCCATA 4444 

TyrThrLMThrS«rCya»nThrS«rV«lII«Thr01faUCysProLytV*l$«rPK^luPpo!l^re!l« 

•Hi ....•••....«....••.....•..•.•••...•••.•.••••.•...••............••.......... 

■ ■ 

iMIl CATTATTOTOCCCCOOCTOOTTTTOCOATTCTAAAATOTAATAATAAOACOTTCAATOOAACAOOACCATOTACA 4521 
Nf»Tyi<yai*ProAUOlyPh^un*lBwlytCya«nAwvly»ThrPK«AsnOlyTKrOlyfroCy»rhr 

■ ■ 

BH1I AATOTCAOCACAOTACAATOTACACATOOAATTAOOCCAOTAOTATCAACTCAACTOCTOTTAAATOOCAOTCTO 4594 
|N| AMV^l^^Tt^^VAlOl^CwTrk^Mi »01 y X t »A^^ V» 1 V » 1 T»%p-0 1 L Ml. 9uA M 01 y mj 

Iql II m~ 

•M II OCAOAAOAAOAOOTAOTAATTACATCTOCCAATTTCACAOACAATOCTAAAACCAT AATAOTACAOCTOAACCAA 447 1 
m AltCIuCluGIwV^lVAlIiareS^AUAtft^ThrAtpAaftAULytThrlUIUVAlOlnLMAafiGlM 

•Hi T 0 0— ACr 

V«l A»»Thr 

IH It TCTOTACAAATTAATTOTACAAOACCCAACUCAAT ACAAOAAAAAOTATCCOTATCCAOAOAOOACCAOOOACA 4744 
INI V»X01 w£ 1 ^A«^Cy*TWr A«-of^p-oA aaA A «n T»%«- A«- y »S «^ 1 1 *A^ 0 Z 1 *01 nAr^Ol y^#-^o01 y*«^o 

IM II 0CATTTOTTACAATAOGAAAAATAOOAAATAT0A0ACAA0CACATT0TAACATTA0TA0A0CAAAAT00AATUC 4121 
Al»Ph«V«lTKrII#01ytyiIl«01yA.«^ArgCl«Al»M««C y »A«r»n«S«rAroAUly«TrpA,oA«n 

»*• - — oc- 

AK. Ill . AU 

iHit ACTTTAAAACAOATAOATA0CAAATTAAOAOAACAATTT0OAAATAATAAAACAATAATCTTTAAOCA0TCCTCA 4494 
. t Thrl«wLy«01nXl«A«pS«rLy«l«uApg0Iw01fiP»wOlyAanA«nLy»ThrIUU«Ph*lytOinS«rS«r 

INI -----•----..--«..-.-..--...--.------.--.---....----——....-....«..._...-. 

•Hit OOA0O0CACCCAOAAATTGTAAC0CACA0TTTTAATTGT06AOO0GAATTTTTCTACT0TAATTCAACACAACT0 4971 
INI Il^Yi iI'^'' M 1 »*«*^ yO 2v»^l^»^I^Tyf-Cy»A Tt>«-OXrtl. 

■ ■ I 

■Nit TTTAATAGTACTTOGTTTAATAGTACTTOGAGTACTAAAOGOTCAAATAACACTOAAOOAAOTOACACAATCACC 7t44 

INI '^*^ $#rThrTr *J^» S « rT ^ T rj*«rfHrt*rfW5«rAifU 

IHit CTCCCATGCAGAATAAAACAAATTATAAACATOTOOCAGGAAGTAOGAAAAOCAATOTATOCCCCTCCCATCAGT 7121 

^ l»urroCyar Q IULy»Gun«IUA«ftlW*TrpCl«01wV»lClyLy«Al«IU*TyrAl»rp©rroIU$«r 

■ « 
IHit OOACAAATTAOATOTTCATCAAAIATTACAOOOC'GCTATTAACAACAGATOOTGOTAATAOCAACAATOAOTCC 7 194 
OlyGlntlttApaCyftSttrS«rAsAXl«ThpGlyl«uL«uL«uThrApqAftp01yGlyAwiS«pAifiA««OluS«r 



IqI II 

iHit OAGATCTTCAOACCTGOAOGA0GAGATATOAO00ACAATT0OA0AA0T6AATTATATAAATATAAAOTA0TAAAA 727 1 
w 01uIX^hap^reCIy«lyGlyAtpH««AroA»pAtnTppArQS«rGlwl«uTypLy»Tyrly»V«lV«Uy* 

\/ 

IHit ATTMACCATTAOOAOTAOCACCCACCAAOOCAAAOAOAAGAOTOGTOCAOAOAOAAAAAAOAOCAOTOOGAATA 7344 
bm n«GluFpoL«u01yV«lAUFpoThrly»AlaLy«ArQArQV«lV»10lMArq01wLy»AroAUV«101y2U 

IH • • OOAOCTTI0TTCCTTOOOTTCTTOGGAOCA0CAGGAA0CACTAT00OC0CAOC0TCAAT0AC0CTGAC00TACA0 742 1 
IN* OjyAUL M ^h«L«uGlyrMlawGIyAUAl«GlvS«rThrH««01yA2aAl«5ttHW«ThrL«wThrVB]Olii 

•Hit OCCAOACAATTATTGTCTOOTATAOTOCAOCAOCAOAACAATTTOCTOAOOOCTATTOAOOCOCAACAOCATCTO 7444 
INI f;*;^a0' n l;"l^3;rtlvIl«V«lOlnOlnOlR^^ 

^ ..... 

•M It TTGCAAC TCACAGTCTOGOOCATCAAOC AXTCCAOOC AAOAATCCTGOCTOTIGAAAGAT ACCTAAAOOATCAA 757 1 
^ L«iOlnLttuThrVa]TrpOlyIl«ly«OlPiL«uGUAUJrQtUL«uAlaValOluApoTyrLMlyftAt0GU 

■ 

•Hit CAOCTCCTGOOOATTTQOOOTTGCTCTOOAAAACTCATTTOCACCACTOCTOTOCCTTOGAATOCTAGTTOOAOT 7444 

0lnl«il«ii01yIl«Trp0]yCy«Sttr01yly«L«wIl«CysThrThrAUVAlFroTppAw«Al s 5srTr9S«r 

•HI — — .............. . .............. . — — . — 
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■ ■ ■ m*4 til 

IN 1 1 AATAAATCTCTOOAACAOAT TTOOAAT A AC AT OACCT0OAT0OAOTO0OACA0A0AAAT TMCAATTAUCAAOC ? 71 I 

M 1« TTAATACACTCtTTAATTOAAOAATCOCAAAACCACKAACUUUAOAATOAACAAOAATTATTOOAATTAOATAAA 7794 

l«utl*N(tS«flttull«OluaiuS«r0lAAw%0lA0liiOlvly^«iiOivOU0ia«ifl«w01wl«i^aply« 472 

■ 

•Mil TOOOCAAOTTTOTOOAATTOOTTTAACATAACAAATTOOCTOTOOTATATAAAATTATTCATAATOATAOTAOOA 747 1 

mm !C^i!f?T LMTr ^ MTr ^^ mnaT ^ A ^ 7r ^^ Tr9TrrI1 * iv * L#uPb#I1 * ,l#%ZUVftl02y * fT 

Mil OOCTTOOTAOOT TTAAOAAT AOTTTTTOCTOT ACTTTCT0TAOTOAATA0AOTTA0OCA30O4TATTCACCATTA 7944 

m 0l V l ^ V » l0l y l ^* r on«V»lfK^l.V.U«w5»rV«lV.lA»«ArgV«lArgOlnOlyTyrS«rPr«l«u 722 

iu 

IN II TCOTTTCAOACCCACCTCCCAATCCCOAOOOOACCCOACAOOCCC^ 112 f 

M ^'N^UTKrNisl«ur^n«^roAeo6ly^roAspA^ro01ii01vtl«61uOlwOluOlyOlyGIuAr« 747 

A«n 

IN II <UCAOAOACAOATCCATTCOATTAOTOAACOQATCCTTAQCACtTATCTOOQACOATCTQCQQAQCCTOTOCCTC 1094 

Af^r^^roS«rIiarol«vV«lA«A01vS«rL«uAUl«uIl«Tr»Aft»AtpLttuArQS«rl«uCyBl«g 772 

IH 19 TTCA(KTACCACCOCTT(MGAOACTTACTCTTOATTOTAACOAOOATTOTO(MACTTCTOOOACOCAOOOOGTOO 1 17 1 

iH | ^^Yv^^^^^oAspltwtMtvwXUVAlThrArolUValOIwlwit^vOlyAr^AroGlyTrp 7f7 

%* II OAAOCCCTCAAATATTOOTOOAATCTCCTACAOT ATTOOAOTCAOOAOCTAAAOAATAOTOCTOTTAQCTTOCTC 1244 

^ OlwAl»l«uly«TyrTrpTrpAaoL»ul»uClnTyrTrpS«H>ln01yL«wly«A^S*rAi«V«lS«rL«wl«u 122 

■ 

IN II AATOCCACAOCTATAOCAOTAOCTOAOOOOACAOATAOOOTTATACAAOTAOTACAAOOAOCTTATAOAOCTATT 132 1 

AMAlaThrAUIl«AUValAl»OlwGlyThrAsoAroV«lIl«01tfV«lV»101f«01yAUTyrAroAl*ll« 147 
PHI • ------ -C" 1 " 

t«u Ala 

CNV-lOft I 

IN II COCCACATACCTAOAAOAATAAOACAOGOCTTOOAAA0GATTTTOCTATAAOATO0OTOOCAA0TOOTCAAAAAG 1394 

M il?? 1 !i 1 •* r * Ar O* r «* 1 *Ar gOlnOlyl ftvOluArgl Ul «ul «i I A S 

JJHI TAOTOTHTTOGATGOCCTOCTGTAAOGGAAACUATa^ 147 1 

INII ATCTCOAOACCTAOAAAAACATOOAOCAATCACAAOTAOCAACACAOCAOCTAACAATOCTOATTOTOCCTOCCT 1544 

•HA ^ T C C T 

ftp* 1 

IjUl A0AAOCACAACAO0ACCA0CAO0TOO0TTTTCCA0TCACACCTCA0CTACCTTTAACACCAAT0ACTTACAAOCC 14? t 

FvuII lol II Aha III | U3 

Polypurlna Tract II 

WHt9 A^TOTAOATCTTAOCCACTTTTTAAAAOAAAAOCOOOGACTC<^COOCTAATTCACTCCCAAC&AAOACAAOA 1414 
(1m MI) 

IN II T ATCCTTOA TCTO TOGA TC T ACC ACACAC 
AMI — 

Jjni CAOATATCCACTOACCTTTOOATOOTOCTACAAOCTAOTACCAOTTGA(KCAOAGAAGTTAOAAIMAOCCAA 4144 

IMA * A^GACAACACCA(KTTGTTACAXCCT0TQA(KCTOCATOGAAT(XMT0ACCCG0ACA(U(M 1921 

INII OAOOTTTGACAGCCOCCTAGCATTTCATCACATOGCCCOAOAGCTOCATCCOOAGTACTTCAAOAACTOCTOACA 1994 
|HA - — * ---- .................... j- 

iNt I TCOAOCTTOCT ACAAOOGACTTTCCOCTOOOOACmCCAOGGAOOCOTOOCCTOOGCOOGM 917 1 

Pvu II U3 — I — I IqI II 

JJJ^I A^C^C ^^^^^ Z * T*^?^*^^ ▼ OCT TT TTOCOTOT ACT OOO TOT C TCTOOT T A OACCAOA T CTOnAOCCT 9144 

HIM III 

NXI2 T€TOOCTAOCTAOOGAAC4CACTO<TTAAOC€TcXaTAaJIo2tTOCCTTOA 9211 

KXI2 ag taotot otocccotctottototqactctootaactacaoatccctcaoa 

NYI> CRCTTTTAATttrtTrtTftrUAAATr.TCTAtflTA 
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FIG. 4 
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FIG. 8 
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FIG. 9 
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